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On (57) Abstract: Modified bacterial surface layer (S-layer) proteins arc disclosed where the modification is the insertion, at an internal 
location, of a heterologous polypeptide, or polypeptide of interest. The polypeptide is a binding or target protein, such as an antigen 

^ or antibody, or part thereof, in particular a bacterial antigen (e.g. from Clostridium tetani such as TTFC). The modified surface 
layer protein can then be expressed on the surface of the bacterial cell and used in a vaccine. Also disclosed are bacteria which 
^^^^ modified to express a heterologous surface layer protein, but which do not as a wild-type possess an S layer (such as L. 
casei), and modified bacteria which express only a modified surface layer protein (and not the wild-type S-layer protein). The wild 

^5 ^yP^ S-layer is completely replaced with the modified version where the polynucleotide encoding the modified version is integrated 
into the bacterial genome. The modified S-proteins can form crystalline arrays, sheets or layers that can be used to bind functional 
molecules (e.g. receptors) to solid surfaces (Au, silicon wafers) in biosensors. 
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MODIFIED BACTERIAL SURFACE LAYER PROTEINS 



Introduction 

The present invention relates to modified bacterial surface layer (S-proteins) 
that can be used in vaccines (when expressed on the surface of a cell), molecular 
sieves and sensors, and bacteria expressing such S-proteins. The S-layer proteins 
have inserted into them, at an internal position, one or more heterologous (or 
functional) polypeptides. Such a polypeptide may be an antigen or an antibody, or a 
portion thereof. Other modifications include the truncation of the carboxy (C) or 
amino (N) terminus to form fiagments. 

Background 

Bacterial surface layer proteins are known to exist in many different types of 
bacteria, and often assemble into crystalline layers at the cell surface. However, the 
exact function of these proteins is not always known, although they are thought to 
play a role in bacterial adhesion to other cells, or to act as a scaffold for enzymes. 

Lactobacillus is one of the genera belonging to the group of lactic acid 
bacteria (LAB) which are used in the manufacture of fermented foods and beverages, 
and are generally considered to be safe. Bacteria of the genus Lactobacillus can 
protect humans against infection, and this property (and their ability to colonise 
mucosal surfaces) has resulted in the use of Lactobacilli as vaccine delivery vehicles 
for oral immunisation. The adhesion to the mucosa is thought to be a pre-requisite 
for the survival and establishment in the gastrointestinal tract (GIT). Some surface 
located molecules have been identified as adhesions fliat specifically interact with 
receptor moieties in the internal tissue. 

The surface layer of proteins self-assembles into multimeric units to form an 
array covering the entire cell as the outermost envelope. They are single protein 
molecules and form crystalline monolayers. However, the adhesive properties of S- 
layers, especially in probiotic Lactobacilli, remains poorly characterised. 
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S-layers have been found in SQvetzl Lactobacillus species, and although the 
function of these S-proteins is not known, they are thought to play a role in cell 
adhesion. S-layer proteins oi Lactobacillus are amongst the smallest S-proteins 
known, and are different from most other bacterial S-proteins because tihiey are highly 
basic (pi of at least 9) instead of acidic proteins. 

Several S-layer proteins have been sequenced. These include the rsoA gene 
of C crescentus (US 5500353, Smit et al), ^nA Lactobacillus acidophilus (Boot et a/, 
1993 and Smit et al, 2001). Although the sequence of the S-protein is given, the 
secondary and tertiary structure of the protein cannot be predicted simply from 
knowing the amino acid sequence. Indeed, from that document it is impossible to 
predict, for example, which amino acid sequence might be exposed to the surface, 
and which amino acids might be present in the S-layer. 

In addition, the cbsA gene encoding the collagen binding S-layer protein of 
Lactobacillus crispatus has been e>q)ressed in L. casei. (Martinez et al, Joumal of 
Bacteriology 182 (23): 6857-6861, December 2000). 

As the structure and role of surface layer proteins is not well understood, 
there is therefore a need to detemiine a structure/function relationship, in order to 
allow the production of modified surface layer proteins which have improved 
properties over the wild-type. Surface layer proteins represent a large proportion of 
protein in the cell (as much as 10 to 12%) and may therefore be used to express large 
quantities of a heterologous protein. In addition there is a need for more effective 
(oral) vaccines, in particular those based on bacteria (or Lactobacillus) where a 
greater amount or higher density of antigen can be presented on the cell, or may be 
better presented at the cell surface. 

Summary of the Invention 

A first aspect of the present invention therefore relates to a (e.g. modified) 
bacterial surface layer (S-layer) protein, the modification comprising the insertion, 
intemally, of a heterologous polypeptide. Such a polypq)tide may be a functional 
one, and can be any polypeptide of interest. 
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The modification is the insertion of the polypeptide at an internal location. 
That is to say, the insertion is at an internal sequence, which is of course differmt 
fronoL the addition (or fusion) of a polypeptide at either the carboxy (C) or amino (N) 
terminus. The insertion may take place at least 5, 10, 20, 30 or 40 amino acids from 
either end of the protein. 

The polypeptide may be inserted (in the sense that it slots in between two 
adjacent amino acid residues) rather than replacing any residues (where residues are 
deleted or substituted). 

Location of inserted heterologous polypeptide 

The polypeptide may be inserted at a location whereby the polypeptide is 
exposed, or away from the cell surface when the surface layer protein is expressed (at 
the surface) of a bacterial cell. The heterologous polypeptide may therefore be 
exposed in the sense that it can be targeted or bound by a compound or other 
polypeptide (for example an antibody). This may be possible if the polypeptide is in 
a portion of the S-layer protein which is involved in or responsible for crystallisation 
and/or assembly, for example is part of the crystalUsation or assembly domain. 

However, the alternative is possible, where the heterologous polypeptide is 
not e3q)osed, for example it is located (at least partly in) in the cell surface, or in the 
S-layer. This may be possible by inserting the heterologous peptide at a location at 
or near a cell wall anchor or is in an attachment or binding domain. This part of the 
molecule may be in the S-layer. In such case the heterologous peptide may be 
present or buried in the surface layer. It may thus be protected from the outside 
environment, for example shielded from proteolytic processing. It may not be bound 
or recognised by external antibodies or shielded from external degradation of 
proteolytic attack. This may be advantageous if the protein is to be located on a 
bacterial cell wall, and then administered to a human, as it may mitigate or avoid 
degradation, or proteolytic processing, by enzymes in the gastrointestinal tract (GIT). 

For the S-layer protein of Lactobacillus acidophilus^ two main domains have 
been idaitified. The first N-terminal region, which constitutes about two thirds of 
the molecule, is involved in crystallisation and assembly (and is called SAN). It 
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constitutes amino acids 1 to 290 and effectively forms the S-layer above the cell 
wall. The second main domain, or last third of tiie molecule, amino acids 290 to 412, 
represents the cell wall anchor (and is called SAC). This portion of the molecule is 
buried in the S-layer. 

The N-terminal assembly region can in itself be split into three main portions; 
1 to 1 14 (involved in forming dimers and trimers); 150 to 290 (involved in forming 
dimers); these flank an intervening domain (about 115 or 120 to 150 or 160) that 
appears to be loop region (or a surface (immunodominant) loop) that is exposed and 
can be targeted by antibodies. 

Therefore, if the heterologous polypeptide is to be exposed, thCTi the insertion 
can be at a location of from any of amino acids 1 to 290, and preferably from 1 15 or 
120 to 150 or 160 (as this appears to be a surface exposed loop region). The N- 
terminal crystalUsation section or assembly region may be primarily hydrophobic, 
such as in Z. acidophilus. This is unexpected, and counter-intuitive, given that this is 
a region that is (surface) exposed, and present in the environment outside the 
bacteria, which is necessary in order for the S-layer proteins to assemble and form a 
layer. 

The cell wall anchor may be equally unusual, in that it is primarily 
hydrophiUc, despite being present in the S-layer. (Cell wall binding regions are 
usually hydrophilic as they bind negatively charged groups in the cell wall). The cell 
wall anchor region can alternate between hydrophilic and hydrophobic regions in 
some S-proteins. The protein may thus be inserted in a hydrophobic region (e.g. for 
siuiace exposure) even though such a region would not usually be exposed on the 
surface. 

The above two paragraphs relate mainly to Lactobacillus proteins, such as 
from L. acidophilus. For many other (S-layer) proteins the reverse is true, and in this 
case the N-terminal domain can be hydrophiUc while the C-terminal may be 
hydrophobic. 

If the heterologous polypeptide is to be exposed, then preferably it is located 
at a position from 100 to 150 or 160, such as from 1 10 to 140 or 150, preferably from 
120 to 130 or 140, optimally at aroimd position 125. Preferably the polypeptide is 
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located in a region of the S-layer protein that is e^cposed or is in a snr&ce (or 
immunodominant) loop. 

Note tiiat the numbering of residues in this specification is (unless the context 
requires otherwise) based on the numbering of the original, (mature) wild-type, 
protein. Another region for exposure of the heterologous protein is from 130 to 180, 
such as from 140 to 170, preferably from 150 to 160, optimally at around position 
156. An additional location for exposing the polypeptide, although to a lesser 
degree, is at one of the positions from 20 to 80, such as from 30 to 70, preferably 
from 40 to 50, optimally at around position 45. 

If the heterologous polypeptide is not to be exposed, or is to be at least 
partially present inside the S-layer, then the insertion can take place at a different 
location in the umnodified S-layer protein. This may be from position 1 to 30, such 
as from 3 to 20, preferably from 5 to 10, and optimally at about position 7. 

Suitably the heterologous polypeptide is inserted into the protein, rather than 
replacing any part of the protein sequence. In that way, most of (eg. at least 70, 80, 
85, 90 or even 95%) or the entire or fidl sequence of the (wild-type or original) 
protein can be retained. Hence, there may be no deletion or replacement of amino, 
acid residues (or if there are, there are no more than 5, 10, 15 or 20 deletions or 
replacements). This may help the modified protein to retain as many of the 
properties or fimctions of the original molecule as possible. The modified protein 
may be at least as long, if not longer, than the unmodified (wild-type) protein. 

Modified Proteins 

The following comments relate to both the modified (of the invention) and 
unmodified (wild type) proteins, unless the context requires otherwise. The protein 
preferably has a minimimi size of 30, 40 or 50 kDa. It may have a maximum size of 
70, 80, 100, 150 or even 200 or 300 kDa. Preferred proteins are from 40 to 50, 60, 
70 or 80 kDa in size. The protein may be able to crystallise, or with other proteins 
form a sheet or crystalline monolayer (e.g. in vitro). They may be able to self- 
assemble, for example into multimeric units. If possible, the protein may be able to 
form (with other proteins) an oblique lattice (for example of pi or p2 symmetry). 
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The protein may have at least two domains or functional regions. One may 
be involved in or responsible for crystallisation or assembly. This region may be 
e>qposed, and away from the cell membrane. The protein may therefore have a (e.g. 
surface exposed) loop region. The protein may have a cell wall anchor or cell wall 
binding region. This may be wholly or partly present in the S-layer. 

Preferably the modified protein will not comprise (or retain) a (eg. S-layer) 
secretion signal (this may have been cleaved during processing or expression). It 
may have a (C-tenninal or cell wall) anchor region, which may be the original region 
from the S-layer protein or it may be (in addition) such a region from another S-layer 
protein, such as of a different £5)ecies (preferably from the species in which the 
modified protein is to be expressed). 

The (unmodified or wild-type) protein may be from a lactic acid bacteria, 
such as from Lactobacillus. Preferably, the protein is from L, acidophilus, L. 
crispatus, L. helveticus, L. amylovorus, L. gallinarum and/or Z. brevis. Preferably 
the protein is predominantly basic. Suitably, the protein is not from a Gram negative 
or aquatic bacteria, for example of the group Caulobacter, such as from C. 
crescentus^ or from the family Campylobacteraceae (eg. those belonging to 
Campylobacter). 

The protein may be expressed in a bacterial host. If it is expressed on the 
surface, such as in the S-layer, then bacterial host cell may be used in a vaccine. The 
heterologous polypeptide may be exposed, e.g. to the enviromnent. If however the 
polypeptide is to survive inside the body (or be shielded from the environment or 
from degradation or proteolytic attack), for example in part of the GIT, tiien the 
heterologous peptide may be partially or wholly present or buried in the S-layer. 

Preferably the protein has a pi of at least 6, 7 or 8. Optimally however it has 
a pi of at least 9, such as at least 9.4. 

Insertion Sites 



The location of Ihe inserted polypeptide may effect crystallisation. Modified 
peptides which crystallise, or do not crystalhse, can both be prepared using the 
invention. The position of insertion of the polypeptide may be at fix)m 1 to 290, such 
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as at any of the following locations (if for example the protein is desired to be able to 
crystallise): 

(1) at from 1 to 20, such as 3 to 15, preferably from 5 to 10, optimally at 
about position 7; 

(2) at from 35 to 55, such as 40 to 50, optimally at about position 45; 

(3) at from 100 to 130, such as from 110 to 120, optimally at about 
position 114; and/or 

(4) at from 1 10 to 140, such as from 120 to 130, optimally at about 
position 125, 

The heterologous polypeptide can be inserted at position above residue 290, 
such as from 291 to 400, 425 or 450 (eg. about 412). If the protein is not to be 
capable of crystallising then the insertion may be: 

(1) at from 20 to 40, such as 25 to 35, optimally at about position 30; 

(2) at from 50 to 80, such as at 60 to 70, optimally at about position 66; 

(3) at from 70 to 100, such as from 80 to 94, preferably from 85 to 90, 
optimally at about position 88; and/or 

(4) at from 1 15 to 150 or 140 to 180, such as from 150 to 170 or 160 to 
180, preferably from 150 to 160 or 170 to 180, optimally at about position 
156 or 177. 

An insertion at above position 290 may not affect crystallisation as this may 
not be in a surface or exposed position (for example it may be in a cell wall anchor 
region). The insertion may thxis be: 

(5) at from 320 to 410, such as from 330 to 370, preferably from 340 to 
360, optimally at about position 349. . 

There may be one or more heterologous polypeptides, inserted at one or more 
different sites. 

Sensors and Molecular Sieves 

The protein or bacterial host cell can be employed in sensors, for example 
included in a coating. They may also find use in molecular sieves. For example, a 
plurality of bacteria, expressing a modified protein, can be arranged to form a hole or 
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to dejSne a pore. By changiiig the location and nature of the heterologous 
polypeptide, one can change not only the protein size and characteristics hut also the 
confirmation of the protein. This is turn may change the pore or hole characteristics, 
for example in certain parameters such as affinity or charge. 

A multiplicity of proteins may be able to crystallise or form 2-dimensional 
arrays on a (e.g. solid) surface (e.g. comprising silicon, polymer(s) or metal(s)) or an 
interface (e.g. a liposome or planar Upid film). An S-layer may be used as a template 
in the formation of an inorganic (e.g. nanocrystal) lattice (e.g. comprising CdS, Au, 
Ni, Pt or Pd), such as in electronics or non-linear optics. 

The functional domains on each S-protein may be repeated with the 
periodicity of the S-layer lattice. This may allow the formation of regular arrays of 
bound molecules or particles. 

The (modified) S-proteins of the invention may interact with each other, and 
if appropriate, with a supporting cell envelope layer (such as plasma membrane, 
outer membrane or peptidoglycan). The protein may have a high proportion of non- 
polar amino acids, so that hydrophobic interactions play a role in assembly. 
Preferably, a multiplicity of S-proteins have the capability to reassemble, for 
example in suspension, at a liquid air interface, at a solid surface, at a floating Upid 
monolayer or on (the outside of) a hposome. The S-proteins may therefore form a 
monolayer, at a sxurface or an interface, and this allows them to be used in 
nanobiotechnological appHcations. 

A further aspect of the invention therefore relates to a sheet or (optionally 
crystalline) monolayer or two dimensional array comprising a polarity of bacterial 
surface layer proteins. At least one of such proteins is preferably a modified protein 
according to the first aspect. 

The invention therefore stems, in a further aspect, to a solid surface, liquid-air 
interface, lipofilm, liposome or solution comprising a sheet, monolayer or array of 
the previous aspect. 

Preferably, a multiplicity of the S-proteins may be disintegrated by a 
chaotropic agent, by lowering or raising the pH value, or by addition of a chelating 
agent or cation substitution. Preferably, after removal of the dismpting agent, the 
proteins will reassemble into a flat sheet, an open-ended cylinder or a closed vesicle. 
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Any of these arrays may be a double layer, or bi-layer, for example with two 
constituent monolayers facing each other (either with their iimer or outer side). Bi- 
layers may, if necessary, contain divalent cations interacting with polar groups. 

The S-proteins may be able to reassemble at an air-water interface, on a solid 
support, or in a lipid film. In this way, a (randomly aligned) monocrystalUne S-layer 
may be formed. If formation takes place at a liquid-solid interface, then the solid 
may comprise silicon, such as a silicon wafer. The siUcon wafer may have a native 
oxide layer, it may be silanised, or it may be coated with a photoresist and/or carbon 
film. Other solids can be employed, such as silicone nitride, galium arsenide, noble 
metals (gold or platinum), glass, cellulose, graphite or mica. 

Preferably, the S-layer can be pattemed using deep ultra-violet (DUV) 
radiation. The S-layer, such as on a silicon wafer, may be brought into (direct) 
contact with a photomask, and exposed to DUV radiation. The S-layer may be 
removed firom the silicon wafer or substrate at the e:q)osed areas. Thus, a pattern on 
the photomask may be transferred into the S-layer. 

The S-layer protein (or sheet or array) may be used to immobilise a 
biologically active (macro) molecule. Molecules such as enzymes, antibodies, 
antigens and/or hgands may be covalently immobilised to an S-protein (mono) layer. 

A layer of S-proteins may therefore act as a surface linking layer. The 
S-protein layer may be inbetween, or sandwiched between fimctional molecules and 
a solid surface or support. The fimctional molecule may be an enzyme, antibody (or 
firagment thereof), or other binding molecule, a receptor, antigen or Ugand, any of 
which may act as a bioreceptor (such as for a target, in a sample). The S-layer 
proteins may be self-assembling, and they may be capable of recrystallising at the 
surface. They may thus provide a linking layer between the soUd surface and the 
fimctional molecules. This type of arrangement may be used in a (bio) sensor, and 
the invention therefore relates to sensors, molecular sieves and ion traps comprising 
an S-protein layer of the invention. 
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The invention additionally relates to modified proteins that are fi-agments of 
bacterial S-layer proteins. The fragments may be N-teraiinal fragments, 
Altematively or in addition, the fragments may be capable of forming trimers, for 
example with two other such fragments. The fragment may be able to fomi 
multimers (e.g. trimers) with different fragments, or indeed the wild type molecule as 
well. These fragments can be thought of as C-terminal deletions because, as they 
retain a portion of the N-terminus, they can be prepared by, in effect, deleting or 
removing all or a portion of the C-temiinus. 

The fragments may be capable of forming dimers, preferably with another 
such fragment (although the formation of dimers with other fragments or the fiiU size 
wild-type molecule may also be possible). Such Augments may include the 
(immunodominant or e^osed) loop region. The fragment may then be from 20 to 
200 amino acids long. Another type of fragment may be one that excludes the entire 
(immunodominant or exposed) loop region. This may allow for at least part of the 
loop region to be present. Such fragments may be from 20, 30 or 40 to 100, 130 or 
155 amino acids long. 

The loop region may be a region that, in the ftill length of wild-type 
molecule, is exposed to the environment when expressed on the surface of a 
bacterium. It may be e3q)0sed in the sense that it may be bound or targeted by an 
(external) antibody specific for that region. The N-temiinal fragments or those that 
are capable of trimerisation may include part or all of the loop region. The fragments 
may thus be from 10 to 30, 40 or 50 amino acids long. For certain bacteria, such as 
L. acidophilic^ this region may be from 20 to 40 amino acids in lengfla, such as from 
32 to 38 amino acids. 

Preferably the fragment is at least 40, 60, 80, 120, 130 or at least 140 amino 
acids in length. The fragments may be up to 170, 150, or 120 amino acids in length. 

If the fragments are fragments of an S-layer protein from Lactobacillus, such 
as Ir. acidophilitSy the fragments are suitably fragments of the assembly or 
crystallisation region, known as SAN in the art, namely residues 1 to 290. 
Particularly preferred fragments (using the numbering of the residues of 
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L. acidophilus S-layer protein) are 1-159, 1-149, 1-114, 115-290, 139-290 and 
150-290. Preferably the fragments will not contain aay part of SAC or the cell wall 
anchor region. 

Novel (Bacterial) Cells 

A second aspect of the invention relates to a (e.g. modified) bacteria, or 
bacterial cell, which has been modified to express a heterologous surface layer 
(S-layer) protein. Preferably, the bacterium is other than an L. casei cell or a 
Bacillus (e.g. B, sphaericus or 5. brevis) cell. Preferably the bacterial cell is 
however Lactobacillus cell or from the family Lactobacillaceae. The S-layer protein 
may have its own (or original) cell wall anchor. In other words, the cell wall anchor 
is homogenous to the S-layer protein, or the protein retains its natural cell wall 
anchor domain, or region. Preferably, the S-layw protein has a cell wall anchor 
which is from the same bacterial species as the S-layer protein. Such a bacteria, as a 
wild-type, or in unmodified form, may not possess such a surface layer. 

Preferably the (host) cell expresses, or contains a gene encoding, from 1 to 6 
different surface layer proteins (either naturally, as a wild type, or in a modified 
form). 

The S-layer protein is preferably one of the modified bacterial surface layer 
proteins of the first aspect, although it need not be. For example, the bacterial cell 
may be one which does not (normally) have an S-layer, (for example, Lactobacillus 
reuteri, Lactobacillus fermentum, Lactobacillus salivarius, Lactobacillus animalis^ 
Lactobacillus gasseriy Lactobacillus johnsonii, Lactobacillus murinus, Lactobacillus 
sakeiy Lactobacillus curvatus^ Lactobacillus delbrueckii subsp bulgaricus, 
Lactobacillus delbrueckii subsp lactis, Lactobacillus plantarum, Lactobacillus 
pentosus, Lactobacillus casei, Lactobacillus zeae, Lactobacillus paracasein 
Lactobacillus rhamnosus or a bacteria from Lactococcus) but in modified fomi does 
have an S-layer (or at least carries an (e.g. modified) S-protein at its surface). The 
S-layer may thus comprise a naturally occxirring surface layer protein, albeit one that 
is heterologous to the bacteria. For example, the bacteria may be L. casei, which 
does not possess an S-layer, but which has beai modified to express surface layer 
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proteins firom another bacteria, such as from a lactic acid bacteria or Lactobacillus, 
for example L. acidophilus. 

A third aspect relates to an L. acidophilus cell which expresses (e.g. on the 
surface of the cell) a heterologous surface layer protein. In this case, X. acidophilus 
does naturally possess an S-layer, and so the S-protein may be a modified one, or one 
from a (bacterial) species other than L. acidophilus, or both. 

The surface layer protein may thus be a naturally occurring or wild-type 
protein, or it may be a modified surface layer protein for example one of the present 
invention, as described in the first aspect, or a mixture of both. Preferably the 
S-layer protein is antigenic and/or induces an immxme response. The protein is 
suitably not apart of a (e.g. collagen) binding surface layer protein, for example of 
L. crispatus. 

Preferably the number of S-layer proteins is sufficient to form an S-layer, and 
preferably an S-layer that covers the entire cell. Hence the number of S-proteins at 
the cell surface may be at least 1, 2 or 5x10^ S-layer proteins (or monomers), 
preferably at least 10^, optimally at least 5x10^ monomers. 

Preferably the S-layer protein, even if modified, retains the cell wall anchor 
of tiie unmodified or wild-type protein. However, it may have a heterologous anchor 
or binding region or domain, and so may be a fusion protein. This may attach the 
protein to the S-layer, cell wall, Upid bilayer, cytoplasmic membrane, peptidoglycan 
or to a cell wall component. Whatever form of linkage, the protein is preferably 
located at or in the surface of the host. 

Four different types of cell wall binding domains are envisaged. The first 
may bind (e.g. covalently) to the peptidoglycan part of the cell wall. Suitable anchor 
may be from a proteinase, such as PrflP (Novarre, 1994). Secondly, the binding may 
be (e.g. covalently) to tiie lipid bilayer, for example through N-acyl diglyceride 
modification of a cysteine residue (for example located immediately C-temiinal to 
the signal sequence cleavage site (Pugsley, 1993). Thirdly, fliere may be a non- 
covalent linkage to the cytoplasmic membrane (for example, Lysteria monocytogenes 
actA , Cocks et al., 1992). A fourth possibility is an anionic interaction, such as non- 
covalent one, which may be to a cell wall component. This may thus be a different 
method from linkage to a peptidoglycan or membrane. 
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For the first type of linkage, the fusion protein may comprise a sorting signal 
(such as from a proteinase (PrtP) or L. casei). Preferably the bacteria heterologously 
expresses aa S-layer protein encoding gene to produce the expressed protein. 

A fourth aspect of the invention relates to an L. casei bacterial cell expressing 
a (bacterial) surface layer protein that is not from X. crispatus and/or is not a collagen 
binding protein. The S-layer protein may be one that does not bind collagen, for 
example it is not a collagen binding S-layer protein. Preferably the S-layer protein is 
antigenic, or capable of inducing an immune response- Preferably, the i. casei cell 
e>q)resses an S-layer protein that is naturally from, or originates from, 
L. acidophilus. However, the S-layer protein may be derived from a wild-type or 
naturally occurring i. acidophilus protein, and for example it may be an S-layer 
protein modified in accordance with the first aspect. The S-protein may be expressed 
on the surface of the cell. This may be achieved by providing the S-protein with a 
cell wall anchor that allows it to bind to the cell-surface of L. casein for example an 
anchor region from an L casei surface layer protein. This may be from a proteinase 
(such as PrtP). 

A fifth aspect of the invention relates to a (modified) bacteria expressing 
only, or homogenously, a heterologous or modified bacterial surface layer (S-layer) 
protein. The protein may be heterologous in the sense that the bacteria does not 
normally or in xmmodified form does not express that protein. The heterologoxis or 
modified protein may therefore replace the natural or wild-type protein, preferably 
completely, or in its entirety. Thus, the bacteria may express only the heterologous 
or modified S-layer protein, but not any of tiie wild-type S-layer protein, at least in 
the S-layer. (Previous aspects included, where appropriate, cells that expressed both 
the wild-type S-protein as well as a modified S-protein). The bacteria may be 
another than from Caulobacter. 

The fifth aspect therefore includes bacteria having a genome which contains 
or incorporates a polynucleotide encoding a heterologous (to the bacteria) or non- 
wild-type or modified S-layer protein. That is to say, such a polynucleotide may be 
integrated into the genome. Preferably, the gene or polynucleotide encoding the 
wild-type or natural S-layer protein has been silenced, switched off, replaced or 
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otherwise caused to be non-expressed. The wild-type may thus be exchanged for a 
mutant (allele), in effect in vivo. 

Pref^ably the surface layer forms an array, and may cover the entire cell. It 
may thus form an outermost envelope. The surface layer may be a crystalline 
monolayer, or comprise (e.g. self) assembled S-layer proteins. These may be in the 
form of monomeric units. The S-layer may therefore comprise an array of single 
protein monomers (the S-protems). 

In all aspects, suitably the S-layer protein, or bacterial cell, is from a lactic 
acid bacteria, or from the Bacillus/Clostridium group of bacterial (e.g. low GC, Gram 
positive bacteria). Suitably the protein or cell will be from the family 
Lactobacillacede, such as genus Lactobacillus. 

Bacterial Cells of the Invention 

The cell may be a Gram positive bacteria, for example from the AT rich 
(rather than GC rich) group (also called the low GC class). Suitably the bacteria is 
from the Actinomycetes class (which includes Camobacterium, Lactobacillus ^ 
Lactococcus, Leuconstoc, Pediococcus and Streptococcus. Preferably it is non- 
aquatic. It may be a Lactobacillus cell, or a lactic acid bacteria (LAB). It is 
preferably Z. plantarum (such as L. plantarum 80 or 256), L. casei, L. brevis, L. 
acidophilus, L. crispatus, L, helveticus, L amylovorus, L. gallinarum otL. lactis. 
Preferably the host cell is one that can express the desired modified protein, which 
may be antigenic, for example using recombinant techniques. The host cell may 
express only the modified S-layer protein (and no wild-type S-laya: protem, as in the 
third aspect above) or it may express both (the modified and xmmodified) S-layer 
proteins. 

A Lactobacillus plantarum strain can readily be determined using known 
parameters (e.g. Bergeys Manual of Determinative Bacteriology and Vescovo et aly 
Ann Microbiol. Enzymol. 43 261-284 (1993)). The skilled person can therefore 
readily determine whether a lactic acid bacterium is a Lactobacillus plantarum. 
Numerous Lactobacillus plantarum strains have been deposited at various institutes 
and are readily available. 
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The bacterial (host) cell should preferably have GRAS (generally regarded as 
safe) status, and more preferably be food grade. This may thus exclude soil bacteria, 
or Bacillus (e.g. food spoilage organisms, sach as Bacillus terreus or sphaericus). 
Such organisms are not suitable for adtninistration to humans (e.g. as a vaccine). 

Also, the bacterial host used should allow - upon transformation with an 
appropriate construct encoding an antigen- expression of the modified protein on the 
surface. 

Preferably the level of expression of the protein, e.g. intracellularly, secreted 
and/or exposed on the surface (such as determined by SDS-polyacrylamide gel 
electrophoresis or FACS), should be at least 0.1, 0.5, 1, 3 or 5, 7, 10 or 15% (e.g. 8, 9 
or 10 to 12, 13 or 14%) of the total cell protein. 

Also, the bacterial host is preferably capable of settling or persisting in and/or 
colonizing at least part of the gastrointestinal tract, such as the mouth, the throat, the 
larynx, the gut, the small intestine, flie large intestine, the ileum and/or the colon, 
vagina, or a combination thereof. Preferably the bacterial host is such that it mainly 
settles in the intestine, more preferably in the ^all intestine or caecum. 

The bacteria preferably exhibits a persistence in the individual to be 
immunized (upon oral administration and as determined by the presence of the strain 
in the faeces) of at least 5 days, preferably at least 9 days, and suitably more than 15 
or even 20 days. Longer persistence may not be reqxiired if an administration 
regimen comprising the use of one or more booster immunisations as described 



The L. plantarum strain may be LMG 921 1 (NCIMB 8826 as described by 
Mercenier above), DSM 4229 L. casei 393, and/or Z. plantarum 80. Preferably 
however the bacterial host is one of the following L. plantarum strains: 256, LMG 
1284, LMG 6907, LMG 8155, LMG 9205, LMG 9206, LMG 9208, LMG 9209, 
LMG 9210, LMG 9212, LMG 1 1405, LMG 1 1460, LMG 8095, LMG 8027, LMG 
12167, LMG 13556, LMG 17552, LMG 18021, LMG 18023, LMG 18024, LMG 
18027, LMG 18095; 386, 299 , 105 or 275 (see Molin et al., 1993. J. Appl. 
BacterioL 74:314), 299v (see WO 96/29083); So5,36^ 95, 120 or 44 (see Johannson 
et al 1995, Int. J. Syst. Bacterid, Vol 45(4):670-675), 79, 107, 98, 53, 97, 101 or 125 
(see Johansson et al 1995 Int. J. Food. Micro. 25:159), CH, ATCC 8041, ATCC 



below. 
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10012, ATCC 10776, WCFS, DF66 Hla, DF66spez. -IVa, and/or the X. plantarum 
strains available from the Japanese Collection of Micro-organisms under the 
accession numbers: 8341, 8342, 8343, 8344, 8345, 8346, 8347 and/or 8348. 

Preferably the (e.g. Lactobacillus) bacterixmi employed is foreign to the 
individual (human or animal) to which it is to be administered, for example 
vaccinated. The bacteria of the invention are modified or recombinant, and it is so 
the wild-type or naturally occxjrring bacteria, that is to say the non-recombinant or 
unmodified bacteria, that is suitably foreign. Thus by foreign it is intended to refer to 
a (e.g. Lactobacillus) strain not normally present in the individual. In other words, it 
is preferred to use an (e.g. Lactobacillus) strain that is non-himian, for example it is 
not found or present in humans. Preferably the strain will be one not found in 
humans or animals, for example a strain that does not exist in the gut (G.I. tract) 
and/or vagina. Such strains will provoke a greater or larger inmiune response. 
Although some prior workers have used human-derived L. plantarum strains (such as 
NCIMB 8826, a human saliva isolate) it is preferred to use a strain not present in or 
originating firom humans. 

The present invention therefore preferably uses a bacterial strain that is not 
foimd in the mucosa (non-mucosal) or G.L tract of individual, for example it is not 
endogenous (to humans or a species of animal to be vaccinated). Indeed, the most 
preferred strains (e.g. L. plantarum 256) are found in silage. Such strains are clearly 
foreign to humans, and so provide an increased immune response. Nevertheless, 
preferably the strain may be capable of adhering to the intestinal mucosa. 

The strain may also be of non-foodstufif origin, for example not foimd in 
(human or animal) foods. This may thus exclude some L. casei strains, e.g. 393. 
Strain 393 is found in cheese, and so preferably strains found in dairy or fermCTitation 
products are ^eluded. This means that the individual has not (or may not) have had 
contact (e.g. ingested) the strain, and so using strains not found in foods is more 
likely to provoke an immune response. This may explain the long persistence times 
found with certain Lactobacilli. It may be that those bacteria are cleared more 
slowly as they are not recognised by the immune system. Suitably the strain is thus 
of animal origin, for example firom an animal feedstuff (e.g. silage). Strains that are 
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generally more suitable are commensal (in the gut), rather than dietary (e.g. dairy 
origin). 

Preferably the bacterial (e.g. Lactobacillus) strain is viable (or alive) and 
intact. Suitably it will be able to persist (in the mucosa) of the individual for at least 
7 days. This can easily be tested using procedures known in the art (for example, 
testing for the existence of the organism in faeces). 

Thus the cell may be a non-human and/or non-human food Lactobacillus 
bacterium, such as L, plantarum, which has been modified to express the modified 
protein intracellularly or on the cell surface. This bacterium is preferably able to 
elicit an immune response in an individual, to whom the bacterium is administered. 
Thus the naturally occurring or unmodified L. plantarum is preferably foreign to that 
individual, for example it is not endogenous to humans or the animal to which it is to 
be administered. Preferably the chosen X. plantarum strain will not be present in the 
G.I. tract or mucosa of humans or that species of animal. 

Other suitable strains may be selected by the skilled person on the basis of 
one or more of the following properties or factors: 

- stabiUty of the construct encoding the antigen in the bacterial host selected; level 
of expression of antigen in the bacterial host selected; regulation of expression of 
antigen in the bacterial host selected; site of expression of antigen in the bacterial 
host selected; and/or stability of antigen produced; 

- the biochemical properties of the strain used, for example its sugar fermentation 
profile (API), cell wall composition, structure of LTA, structure of 
peptidoglycan, 16S RNA sequence, acid resistance, bile acid resistance, 
agglutination properties, adjuvanticity, immune modulating properties, in vitro 
adherence properties, maimose-specific adherence, presence of proteinaceous 
adherence factors, presence of mapA-like adherence factors and/or presence of 
large proteinaceous adherence fectors with repeated amino acid sequences; 
and/or 

- the interaction of the bacterial host with cells of the individual to which the host 
to be administCTed (i.e. as part of a vaccine according to the invention) including 
but not limited to its persistence , viability, in vivo expression of antigen and/or 
tissue-specific persistence. 
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In at least some, preferably most, and more preferably essentially all of these 
properties, Hie strain used should be essentially (at least) equivalent to the strains 
mentioned above, and more preferably equivalent to L. plantarum 256, e.g. as 
determined on the basis of tests/assays for these properties known per se in the art. 
After a suitable host has been selected, it may be transformed with a genetic 
construct as described herein, after which its suitabiUty as an oral vaccine may be 
tested. It is envisaged that on the basis of the description herein, and optionally after 
carrying out - for the purposes of confirmation- the tests described herein, the skilled 
person will be able to identify other L. plantarum strains suitable for use as or in 
vaccines of the invention. 

Preferably the X. plantarum is in the group (or cluster) 1, (which includes 
strains 101, 97, 53, 256, ATCC 14917, 36^ 95, 98, 299, 299v, 107, 105, 79, 275, 
386, So5 and ATCC 8014), suitably subgroup (or subcluster) lb (which includes 
256, ATCC 14917, 36^ 95 and 98), rather than subgroup (or subcluster) Ic (which 
covers human originating strains). Particularly preferred is the L. plantarum strain 
256. 

To classify L, plantarum strains a method can be used in which the genomes 
of the Lactobacillus strains are characterized by performing a restriction 
endonuclease analysis (REA) of chromosomal DNA followed by agarose gel 
electrophoretic separation of the fragments (Stahl et al. Int. J. Syst. Bacteriol. 40: 
189-93 1990; Johansson et ah Int. J. Syst, Bacteriol. 45, 670-675 1995). The power 
of the REA technique is that it provides a unique fingerprint of the chromosomal 
DNA of each strain. When REA was applied to classify L. plantaum strains and the 
results were analysed by cluster analysis it was shown that this species is comprised 
of two clusters. Cluster 1 comprises strains originating firom both human intestine 
and fermented food and feed, whereas cluster 2 comprises only two strains from 
Nigerian ogi. Cluster 1 contains 3 subgroups, la, lb and Ic. All human Z. 
plantarum strains were found in group lb. L. plantarum strains of non-human origin 
were foimd in all 3 sub-groups. 

A further distinction between L. plantarum strains from the three subgroups 
of cluster I was described by Adlerberth et al (Appl. Environm. Microbiol. 62, 
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2244-2251 1996) who observed that strains from subgroup Ic adhered considerably 
stronger to human adenocarcinoma cells in vitro compared to strains from subgroups 
la and lb. Strains from subgroups la, lb and Ic can be further discriminated on the 
basis of their hemagglutination patterns. Strains from subgroup Ic strongly 
agglutinated hinnan, guinea pig, chicken, cat dog, mouse, rabbit, rat, horse and pig 
erythrocyte, while those from la and lb display strongly reduced or no agglutination 
characteristics. 

A combination of one or more of the strains mentioned above may also be 

used. 

Preferably the host cell has only a limited persistence in the gastrointestinal 
tract, such as from 3 to 10 days, preferably from 4 to 8 days, similarly from 5 to 7 
days. 

Brief description of the Figures 

Figure 1. (A) Linker insertion mutagenesis and E. coli expression vector 
constructions. Sites chosen for linker insertion are indicated by solid vertical arrows. 
For details on linker insertion positions the reader is referred to Table 1. (B) 
Lactobacillus vector constructions. Promoter sequences are indicated by an arrow 
pointing to the right and a stemloop indicates Ihe presence of a transcription 
terminator sequence. Black barr indicates the slpA sequence encoding mature Sa- 
protein and grey bars the signal sequence. Restriction sites are abbreviated as 
follows: B, Bamm; K, Kpnl; P, Pstl; Bs, BstEIl; H, Hindm, Nc, Ncol; N, NotI; X, 
Xhol, S, Saa; Bg, BgUI; E, EcoBl. 

Figure 2. SDS-PAGE analysis of SA-protein mutants purified from E. coli. Total (T), 
pellet (P) and supernatant (S) fractions obtained by dialysis and centrifiigation. The 
numbers above each panel represent mutant numbers. 

Figure 3. SDS-PAGE and westem analysis of surface protein extracts obtained from 
wild type L. acidophilus (lane 1) and transformants producing mutant SA-proteins 
Sa9c, Sauc, Sai2c and Saisc G^nes 2 to 5). The top panel shows a coomassie stained 
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SDS-PAA gel, the middle panel a western analysis with anti-SA-protein and the 
lower panel a western analysis with anti-c-myc. The arrow indicates SA-protein (43 
kDa) and the Mr's of the reference proteins are given in the left margm. 

Figure 4. Detection of c-myc at the cell surface of acidophilus transformants using 
flow cytometry. Cells were coated with mouse monoclonal anti-c-myc antibodies 
followed by fluorescent labelling with goat-anti-mouse FITC-conjugated antibodies. 
Fluorograms of £. acidophilus producing Sa9c, Saiic, Sai2c and Saisc are indicated 
wititi 9c, 1 Ic 12c and 13c, respectively. The inset shows the negative control (wild 
type L. acidophilus). 

Figure 5. Western analysis of SAN and four N- or C-tenninally truncated SAN 
peptides. Peptides were separated by SDS-PAGE and detected with anti-his 
antibody. OUgomers of SAN (®) and duners of SAN2, SAN4, SAN6 and SAN7 
peptides are indicated with arrows (lane 1 to 5, respectively). Mr's or reference 
proteins are indicated in the right margin. 

Figure 6. Hypothetical structural organisation of SA-protein monomer and the L, 
acidophilus S-layer. (A) Contact points of one repeating unit with surrounding units 
are indicated with arrows and the putative boundary between two morphological 
subdomains (black and black shading, respectively) with a dashed line. (B) Overview 
of repeating units forming the S-layer structure. Repeating units as well as 
subdomains are indicated as in (A). (C) Schematic representation of one SA-protein 
monomer showing the SAN and SAC domains and the two putative SAN 
subdomains. (D) Schematic representation of a cross-section of theZ. acidophilus 
cell wall and S-layer. 

Brief description of the Sequences 

SEQ ID NO. 1 is a DNA sequence encoding the (prior art) L. acidophilus 
(mature) S-layer protein (slpA gene) which was modified to create the surface layer 
proteins of the invention; 
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SEQ ID NO. 2 is the amino acid sequence of SEQ ID NO. 1; 

SEQ ID NOS. 3 to 17 are amino acid sequences showing the position of (and 
including) the heterologous polypeptide (sequence) used to create the modified 
proteins of the invention; and 

SEQ ID NOS. 18 to 33 are primers used to construct the proteins of the 
invention. 

Mode of vaccine delivery 

Preferably this is a mucosal or oral or nasal vaccine. This may contain one or 
more of the bacterial cells of any of the aspects of the invention previously described. 
The vaccine may comprise lactose, such as to assist in the expression of the modified 
protein. This can mean any vaccine suited, adapted, intended and/or fomiulated for 
mucosal delivery. 

Mucosal delivery means any route of administration to the body of a human 
or animal for which it is not required to penetrate or puncture the skin (e.g. as with 
intravenous, intramuscular, subcutaneous or intraperitoneal administration). Usually, 
this means that the vaccine is administered to the body via one of the body cavities, 
such that it comes into contact with the mucosa. Hence mucosal administration in 
particular refers to nasal, oral and/or vaginal administration. 

Oral delivCTy (of a vaccine) means any route of delivery to the body of a 
human or animal into, by which the vaccine can be presented to, the gastrointestinal 
(G.I.) tract or any part thereof Usually, this will involve administration into or via 
the mouth into the G.I. tract. This also includes administration directiy into the G J. 
tract or into any part thereof, and including into the stomach, for instance using a 
tube or catheter. 

An oral vaccine is one that is suited, adapted, intended and/or formulated for 
oral delivery as defined above. 
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Immune Response 

Suitably the vaccine will elicit an (immune) response. A response (e.g. an 
antibody response or immune response) is deemed significant if it leads to a 
detectable change or response in a human or animal, and in particular to a detectable 
immunological change or response, such as the production of antibodies, cytokines, 
lymphokines, etc. Tests for determining whether a response is significant are known 
in the art and include, but are not limited to, titration of antibody levels in biological 
samples using ELIS A techniques, ELISPOT techniques and in vitro lymphocyte 
stimulation assays. Such techniques are usually carried out on a biological sample, 
such as a biological fluid or cell sample, obtained from the human or animal. 

A significant response may be, but is not necessarily, also a protective 
response as defined below. A response (e.g. an immunological response against a 
pathogCTL or an antigen) is deemed significant when it is capable of protecting the 
human or animal having the response against flie patbogen and/or against a pathogen 
associated with the antigen. 

An antigen is deemed exposed on a bacterial host (also referred to as 
(surface) exposition of the antigen) when it is present, forms part of, is attached to, 
and/or is otherwise associated with or detectable on (e.g. using a suitable 
immunological detection technique such as FACS or immunofluorescent 
microscopy) the surface of the bacterial host (e.g. the bacterial cell wall or envelope) 

Preferably, exposed means that the bacterium, when presented to a cell of a 
human or animal that is capable of mediating an immune response (such as the cells 
of the G.I. tract mentioned below) for a sufficient time and in a sufficient amount, is 
C£q)able of eliciting a sufficient immune response against the antigen. 



A preferred embodiment of the invention is a vaccine wherein the bacteria 
comprises an expression vector, for example capable of expressing the modified 
protein (e.g. an antigen), suitably on the cell surface (or intracellularly). The 
modified protein may thus be expressed internally or externally. Preferably the 



Vaccines 
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protein is exposed on the cell surface (e.g. under conditions present in the 
gastrointestinal tract). 

The bacteria may egress a modified protein (e.g. the heterologous 
polypeptide) specific for mucosa colonising pathogens or pathogens entering the 
body via the mucosa, specifically via the oral route. The heterologous polypeptide 
(e.g. antigen) may thus be specific for a gastrointestinal tract colonising pathogen. A 
heterologous polypeptide specific for tetanus (Clostridium tetanus)^ such as TTFC, is 
a particularly suitable candidate. 

Preferably the vaccines are formulated such that a single dose is sufficient. 
However embodiments where multiple applications over a period of time, e.g. with a 
view to the persistence of the bacterial host in the G.I. tract, are also envisaged. The 
provision of booster vaccinations is also envisaged with the vaccine formulations 
according to the invention. 

A preferred administration regimen comprises one or more "initial" doses or 
administrations on any of days 1 to 4, followed by one or more booster 
administrations on any of days 14 to 21, and optionally one or more fiirther booster 
administrations on any of days 28 to 25. A single initial administration, followed by 
a single booster administration, within this time period, will generally be sufficient 
As high a degree of expression as possible without damaging the viability of 
the cell or the host to be vaccinated is envisaged. With higher expression, less 
frequent and lower doses may be required for immunisation purposes. Naturally the 
dosage regime will not only depend on amount of antigen but also on antigen type 
and the presence or absence of oth^ immunogenicity stimulating factors in the 
vaccine. 

One can use homologous expression and/or secretion signals in the expression 
vector(s) present in the recombinant bacteria in the vaccine: this may give a high 
degree of expression. The expression regulating signals used in the constracts in the 
Examples, as well as other expression signals, are suitable. 
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Heterologous polypeptide 

Such a polypeptide can be a binding or target protein, a linker, or an antigen 
or antibody or part thereof. It is preferably an antigen, and so may be able to elicit or 
stimulate an immxme response. It can be any antigen against which an immune 
response, more specifically a significant immune response and/or a protective 
immxme response, can be ehcited in an animal (preferably a manunal such as a 
human). The polypeptide (or antigen) may be associated with a pathogen, disease 
state and/or disorder of the human or animal to which the vaccine is to be 
administered. Preferably the antigen will be able to interact with one or more (e.g. 
specific) receptors, for example present on lymphocytes or in antibodies released 
from them. 

The polypeptide can thus comprise an immunogen. Since the polypeptide can 
be one that can elicit an immune response, this may exclude for example a protein 
that is present aheady or originates firom in the individual. Antigens are thus 
preferably foreign to that individual. 

Any antigen, antigenic component or epitope known per se that can be 
expressed in the host can be used as the polypeptide. Usually this will be a peptide, a 
protein, or an antigenic part or firagment thereof, such as an epitope. As such, it may 
either be a native antigenic peptide or protein (or part, firagment or epitope thereof) or 
an antigenic analog or mutant thereof, for instance obtained synthetically or using 
recombinant DNA techniques. 

Recombmant bacteria and/or bacterial strains, as well as vaccines based 
thereon, may be provided that can be used to illicit a significant immune response, 
and preferably a protective immune response, against various antigens. Suitable 
(heterologous) polyp^tides thus include: 

- antigens or antigenic determinants firom a virus, bacteriimi, fimgus, yeast or 
parasite; 

- allergens; 

- viral and/or bacterial antigens including those fi*om (e.g. the gp 160 envelope 
protem of) the HIV virus, such as mV-l or HIV-2, a surface glycoprotein (of a 
Leishmania parasite), Shiga-like toxin. Shigella lipopolysaccharide antigen. 
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Escherichia coli heat labile toxin B subunit or K88 or fimbrial antigen^ a CFA 
antigea (of an enterotoxigenic Escherichia coli strain), anthrax toxin, pertussis 
toxin or toxin firom Bordetella pertussis (e.g. P69), tetanus toxin (or a fragment 
thereof, e.g. TTFC); 
— antigens firom such pathogens as herpes virus, rubella virus, influenza virus, 
mumps virus, measles virus, polio(myeUtis) virus (tj^e 1, 2 or 3), rotavims, 
FMDV, respiratory syncytial virus, Campylobacter species. Chlamydial 
organisms, species of the genus Cryptosporidium, cytomegalovirus, human 
immunodeficiency virus, Actinomyces species. Entamoeba histolytica, 
arenavimses, arboviruses, Clostridium botulinum, species of the genus Candida, 
Vibrio cholera, Cfyptococcus neoformans, EHEC strains of Kcoli 0157:H7, 
026:H1 1, Ol 1 1 :H8 and O104:H21, ETEC strains ofE. coli strains of Kcoli 
shown to possess enteroinvasiveness (EIEC), EPEC strains of E.coli, EAggEC 
strains of Kcoli,, DAEC strains of E.coli, filoviridae, parvoviras, Filarioidea, 
Staphylococcus aureus, species of the genus Clostridium perfringens, C. tetani, 
V, cholera, N. meningitides, C. trachomatis, Helicobacter pylori, Caliciviruses, 
Giacardia lamblia, Neisseria gonorrhoeae, hantaviruses, hepatitis vimses types 
A, B, C, D, E, Legionellae strams, Mycobacterium leprae. Listeria 
monocytogenes, species of the genus Clostridium perfringens, Borrelia 
burgdorferi, Pseudomonas pseudomallei, Epstein Barr virus. Onchocerca 
volvulus. Poxviruses, Bordetella pertussis, Yersinia pestis, Coxiella bumetti, 
rabies virus, Treponema pallidium, Mycobacterium tuberculosis. Salmonella 
typhi, eukaryotic parasite causing malaria, Pneumocystis pneumonia, as well as 
agents causing toxoplasmosis. 

Preferably the polypeptide does not comprise a pilus type epitope (e.g. a pilin 
or adhesintope, such as fix)m P. aeruginosa). The polypeptide may be a combination 
of two or more of the above linked (e.g. covalently) or as a fiision protein. 

Preferably the allergen is a human allergen, or an allergen provoking an 
allergic reaction in the type or species of individual to whom the composition is to be 
administered. It may be a house or insect all^gen, such as firom dust mite, e.g. 
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Apart from native antigens and antigenic components (including aatigenic 
parts, fragments or epitopes thereof), antigenic mutants or analogues thereof - 
obtained synthetically or via recombinant DNA techniques - may be used Also, in 
the vaccines of the invention, a combination of two or more such antigens may be 
present or expressed. These antigens may be expressed by a single (type or strain) of 
bacterial host, or by several different (types or strains) of bacterial host. 

Of the above, antigens against and/or specific for rotavirus, respiratory 
syncytial virus, Mycobacterium tuberculosis^ human immunodeficiency virus, E,coH^ 
Vibrio cholera, streptococci and chlamydia are especially preferred for use as an 
antigen in the vaccines of the invention. 

The polypeptide is preferably such that, upon expression, it still allows - at 
least to some extent, which may be reduced compared to the native strain - the 
recombinant bacterial host to settle in and/or colonize (part of) the gastrointestinal 
tract upon administration, and to persist there. This may be for a time sufficient to 
provide a significant immune response against the antigen and/or the pathogen 
associated with it. 

Preferably the polypeptide is at least 5, 10, 15, 20, 30 or 40 amino acids in 
lengtii. The polypeptide may be no more than 50, 70, 100 or 200 amino acids long. 

Fusion proteins 

The heterologous polypeptide need not be a single protein, indeed it may be a 
combination of two or more polypeptides. However, the polypeptides(and hence 
also tiie modified protein) is suitably heterologous to the cell in which the (S-layer) 
protein is expressed. The heterologous polypeptide may therefore be an antigm or 
toxin linked to a viral protein, for example a tetanus protein (or antigen) covalently 
linked or fused to a viral protein. Preferably the tetanus protein is TTFC. The viral 
protein is preferably from a rotavirus, such as of the group Reoviridae. It may be 
from a rotavirus of group A, B or C. The viral protein may be a capsid protein, for 
example an inner capsid protein (such a VP6) or an outer capsid protein (such as a 
VP4 or VP7). The protein may be a viral capsid protein and/or a viral surface 
protein. It may be a glycoprotein. 
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Preferably the viral protein is at least 20, 25, 30, 40 to 50 amino acids in 
length. Preferably it is not a transmembrane protein, or a protein from HIV (such a 
GP41). 

The modified protein may additionally possess an anchor, or an anchor or 
binding regional domain, in order to allow it to be attached or tethered to a cell 
surface. This may be necessary if the bacterial cell does not normally express 
S-layer proteins, for example if it does not possess an S-layer. The anchor may be 
attached to the C-terminus. For example, if one is to obtain expression in L. casei or 
L, plantarum^ which do not possess an S-layer, then one can employ a cell wail 
anchor or binding region or domain (such as described earlier for location at the 
surface of the host), and if necessary a sorting signal, for example fromZ. casei^ such 
as PrfP (Martinez et al, supra). 

Lactobacillus or LAB Expression systems 

Usxxally, the e3q)ression system will comprise a genetic construct comprising 
at least one nucleotide sequence encoding the (modified) polypeptide, or antigen(ic 
component), preferably operably linked to a promoter capable of directing expression 
of the sequence in the bacterial host. Suitably the polypeptide to be expressed can be 
encoded by a nucleic acid sequence that is adapted to the preferred codon usage of 
the bacterial host . The construct may further contain (all) other suitable element(s), 
including enhancers, transcription initiation sequences, signal sequences, reporter 
genes, transcription termination sequences, etc., operable in the selected bacterial 
host. 

The construct is preferably in a form suitable for transformation of the 
bacterial host and/or in a form that can be stably maintained in the bacterial host, 
such as a vector or plasmid. More preferably, a food grade construct is used. 

A particularly preferred constmct according to the invention comprises the 
multi-copy expression vector described in WO-A-96/32487 or WO-A-01/21200, in 
which the nucleotide sequence encoding the antigen has been incorporated. Such a 
construct is particularly suitable for expression of a desired protein or polypeptide in 
a lactic acid bacterium, in particular in a Lactobacillus, at a high level of expression, 
and also can be used advantageously to direct the expressed product to the surface of 
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fhe bacterial cell. The constructs may be characterised in that the nucleic acid 
sequence encoding the antigen is preceded by a 5' non-translated nucleic acid 
sequence comprising at least the minimal sequence required for ribosome recognition 
and RNA stabilisation. This can be followed by a translation initiation codon which 
may be (inmiediately) followed by a fragment of at least 5 codons of the 5' terminal 
part of the translated nucleic acid sequence of a gene of a lactic acid bacterium or a 
structural or functional equivalent of the fragment. The fragment may also be 
controlled by the promoter. The contents of these two WO publications, including 
the dififering embodiments disclosed therein, and all other documents mentioned in 
this specification, are incorporated herein by reference. 

The vaccines of the invention are preferably oral vaccines, that is to say they 
are adapted for oral administration. Such oral vaccine compositions will usually be 
alkaline, since usually alkali is required in order to neutralise acid in tiie stomach, 
and allow the bacteria (or at least most of them) to pass through the stomach into the 
intestine alive. It is preferred that most of the bacteria adnoinistered will survive the 
stomach, and pass into the intestine. Increased immune responses can be achievable 
when the bacteria are alive, that is to say viable, rather than dead. This is because 
they can continue to express the modified protein in vivo. 

If the heterologous polypeptide is inserted at a location where it is not 
exposed, or at least partly present in the S-layer, then this may reduce or avoid 
unwanted degradation or proteolytic processing of the polypeptide (in the GI tract). 
Not all pharmaceutical formulations will be alkaline, and therefore those that are not 
alkaline (for example nasal formulations) may not be suitable for oral administration. 

The sequence encoding the modified protein and/or heterologous polypeptide 
(e.g. antigen) can be obtained firom any natural source and/or can be prepared 
synthetically using well known DNA synthesis techniques. The sequence encoding 
the antigen can then (for instance) be incorporated in a suitable expression vector to 
provide a genetic construct of the invention, which is then used to transform the 
intended bacterial host strain. 

The recombinant bacterial host thus obtained can then be cultured, upon 
which the harvested cells can be used to formtilate the vaccine, optionally after 
further purification and/or processing steps, such as freeze-drying to form a powder. 
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The techniques required to create the genetic constructs containing the 
antigen-encoding sequence and for transforming, culturing and harvesting the 
bacterial hosts are well known in the art. For instance they are described in standard 
handbooks, including Sambrook et al, and F. Ausubel et al^ eds., "Current protocols 
in molecular biology". Green Publishing and Wiley Interscience, New York (1987). 

The vaccine comprising the bacterial host can be formulated in a known 
manner, such as for the formulation of vaccines and/or for the formulation of 
preparations of live bacteria for oral administration to an animal or human. Reference 
can be made to preparations for the administration of probiotics, e.g. for the 
treatment of gastrointestinal disorders. 

The vaccine according to the invention can be in a form suitable for oral 
administration, which may be a solid, semi-solid or liquid form, including but not 
limited to solutions and/or suspensions of the bacteria, which are usually preferred. 

The vaccine preparation may also be in the form of a powder, such as a freeze 
dried powder that can be reconstituted before use, e.g. using a suitable liquid. It may 
be in the form of a solid or liquid preparation that is (to be) mixed with solid, semi- 
soUd or Uquid food prior to administration. It may also be in the form of a fermented 
product. 

Besides the bacteria, the vaccine may contain one or more pharmaceutically 
acceptable carriers or excipients, such as water. The vaccine may also contain one or 
more adjuvants, including immime adjuvants, suitable for oral administration. These 
are compatible with the bacterial host and suitably do not interfere (too much) with 
its desired immunogenic properties. According to one embodiment, the adjuvant may 
be a lactic acid bacterium, such as the bacterial host strain itself one of the other L. 
plantarum strains mentioned above, another Lactobacillus species, or even a 
Lactococcus, Bifidobacterium or Propionibacterium species suitable for oral 
adnunistration to humans or animals. 

Also, the vaccine may contain one or more further therapeutic substances 
and/or one or more substances that can facilitate and/or enhance the colonization of 
(part of) the G.I. tract by the bacteria, and/or the growth of the bacteria in ttie G.I. 
tract. The prq)aration may also be in a form suitable for (direct) administration into 
the stomach or gut, for instance via a tube or catheter. Preferably upon oral 
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administration the bacterial host settles in, and may thereupon colonise, the 
gastrointestinal tract, or at least a part thereof, such as the mouth, the gut, the small 
intestine (e.g. duodenum, jejunum or ileimi), the large intestine (or part thereof such 
as the caecum) or colon, and preferably either the small intestine or the caecum. 

Immunogenicity 

The modijaed protein, and so, if desired, the heterologous polypeptide (e.g. 
antigen) e^qpressed by the bacterial host thus can come into contact with the mucosal 
layer, the lining and/or the wall of the G.I. tract (collectively referred to hereinbelow 
as "wall of the GJ. tract"), and more specifically with cells within this wall. This can 
mediate an immune response against the antigen(s) thus presented, such as antigen- 
presenting cells (for example macrophages, dendritic cells and/or B-lymphocytes). 
This immunological response by the cells within the wall of the G.I. tract may by 
itself already constitute a significant immune response as defined herein, and/or it 
may trigger further inraiunological reactions/responses in the body of the human or 
animal to which the vaccine has been administered, which again may be a significant 
response and/or may be a protective response as defined herein. However, the 
invention is not limited to any specific mechanism via which the recombinant 
bacterial host elicits any immune response(s). For instance, the immune response 
elicited by an antigen as expressed by the bacterial host may provide a stronger or 
enhanced immune response compared to the response that would be elicited by the 
antigen as such, e.g. as a (free) soluble protein. 

The bacterial host - or any part or fragment thereof and/or any fiirfher 
compound(s) produced by it - may interact with the wall of the G.I. tract and/or with 
specific cells therein such as the cells which mediate and/or are involved in the 
immune response. This may enable, facilitate or enhance the inmiune response to the 
antigens associated with the bacterial host, such as compared to adnMnistration, 
expression and/or use of the antigen as a free, soluble protein. 

Although the (modified) protein is preferably expressed by the bacterial host 
such that the antigen becomes exposed on the cell surface, it is not excluded that - in 
order to illicit the immime response - the contents of the cells of the bacterial hosts 
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are in situ (i.e. locally at the wall of the GJ. tract) released and/or liberated from the 
bacterial cell, e.g. by a mechanism which makes the walls of the bacterial cell wall 
permeable and/or destroys the cells or the bacterial cell wall. Thus it is possible that 
(in situ at the wall of the G.I. tract) the immxmogenic response may not be (just) 
caused/elicited by the intact bacterial host, but by a part, fragment, fraction or 
compoxmd thereof for example the antigen as such and/or cell fragments or cell 
fractions comprising the antigen. Thus although it is preferred that the vaccines 
comprises intact, viable and/or live bacteria, vaccines that (e.g. also) contain 
fragments, fractions, lysates etc., of or derived from the recombinant bacterial host 
are not excluded. 

Dosages 

The amount of bacteria administered is not critical, but suitably it is sufficient 
for the bacteria to settle into and/or colonize (the desired part of) the G.I. tract, and/or 
to cause a significant immxme response. A suitable amount will be at least 10* cfti, 
preferably 10^ - 10^° cfu per dose. This may allow a sufficient amount of bacteria to 
pass into the intestine, if required. Oral administration of doses less than 10* cfu may 
not always give the desired immunogenicity (at least not in a reliable manner), 
whereas amounts of more than 5x10^^ cfu, if cumbersome to administer orally, are 
less preferred. The above amounts of bacteria (dosages) may correspond, to for 
instance, 10^ to 10* cfu per kg of body weight of the human or animal. The 
concentration of bacteria in the vaccine (or other formulation) may be at least 
5xl0^/ml, such as at least lO^^/ml. The formulation may be administered for only up 
to 2, 3 or 4 days. The bacteria may still be detectable in the individual at least 5 
days, 7 days or 9 days after tiie first or last administration. 

Vaccine formulations 

Preferably the individual to be vaccinated is a human or an aziimal. The 
human can be an infant, immunocompromised person, elderly person or a normally 
healthy infant, child or adult. 
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One advantage of the bacterial hosts used is that they can be capable of 
surviving in/passing through the gut in amounts sufficient to colonize the intestine(s). 
Nevertheless, one can administer the bacteria in or as a coated or encapsulated 



coated composition. Suitable encapsulating compounds include but are not limited to 
chitosan, maltodextrin, lipids and oligo- and polysaccharides. Encapsulation may 
also improve the shelf-Hfe of the vaccine. The vaccine may be adjuvant-free but 
preferably contains one or more adjuvants. 

Strains of oHx&cLactobacillus species may also prove sxiitable for use as a 
bacterial host, for example in vaccines of the invention. These can include strains 
from L. pentosusy L. reuteri, L. animalis (= L. murintis), L.fermentum, L, 
acidophilus, L, crispatus, L. gasseri, L. Johnsonii, L. salivarius, L. brevis, L. 
rhamnosiSj L. helveticus and/or i. paracasein 

The strains usefril in the invention preferably have GRAS status and more 
preferably are food-grade. Also, they are most preferably used in combination with 
the expression vectors of WO-96/32487 mentioned above, or another vector that 
gives a level of expression (e.g. intracellularly and/or exposed on the surface of the 
bacterial host) comparable to this preferred expression vector. 

Furthermore, although the use of strains belonging to the genus Lactobacillus 
are preferred, it is envisaged that suitable strains could possibly also be selected from 
bifidobacteria and the propriobacteria, e.g. from the genus Bifidobacterium and/or 
the genus Propriobacterium. Suitable strains can be selected by the skilled person in 
the same way as L.plantarum and/or Lactobacillus can be selected. 

For instance, a suitable test for detennining/confirming whether a selected 
strain is suitable as a bacterial host according to the invention, is to transform the 
host with a plasmid carrying the replication region of pAMBl, pNZ71, pLP323 under 
the control of promoter sequences such as the regulatable promotes Pamy and Pxyl 
or the constitutive promoters Pldh and Pslp A. Examples of such vectors are the 
TTFC carrying vector pPG421 (for surface anchored/surface exposed expression of 
the TTFC-antigen under control of Pamy) and/or tiie TTFC carrying vector pPG623 
(for intracellular expression of the TTFC-antigen under control of Pxyl), tiien to 
administer the recombinant host thus obtained orally to an animal, preferably a 



preparation, for instance in the form of a delayed release composition or an enteric 
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mammal (e.g. a motise such as a BALB/c and/or C57bl/6 mouse), preferably 
according to the single dose priming and boosting schedule as mentioned in the 
Examples, followed by measuring the end-point titres of IgG in individual sera by 
ELIS A using tetanus toxoid. In such an assay, the selected recombinant host 
preferably provides higher (i.e. at least 1% higher) titres than L,plantarwn NICMB 
8826 and/or L.plantarum 80 when transformed with the same vector and 
administered under the same conditions; and more preferably titres which are at least 
10% higher, even more preferably at least 20% higher. 

Where the law allows, the invention also relates to the administration of the 
bacterium or vaccine to an individual, such as a human or animal (e.g. a mammal), 
where that individual (or subject where appropriate) is in need of bacterium or 
vaccine. The individual may require treatmait or prophylaxis or a particular disease, 
as described above. 

Further Variants and Homologues 

The (modified) proteins of the invention can if necessary be produced by 
synthetic means although usually they will be made recombinantly. They may be 
modified for example by the addition of histidine residues or a T7 tag to assist their 
identification or purification or by the addition of a signal sequence to promote their 
secretion firom a cell. 

Modifications can be made to the original or wild-type sequence, and yet the 
protein may still be an S-layer protein. This may include the deletion, modification 
or addition of single amino acids or groups of amino acids within the protein 
sequence, preferably so that the p^tide maintains the basic biological fimctionality 
of the protein. 

Conservative substitutions may be made, for example according to the following 
Table. Amino acids in the same block in the second column and preferably in flie 
same line in the third column may be substituted for each other. Pref^ably 
substitutions do not affect tiie folding or activity of the polypeptide. 
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Alterations 

Proteins of the inventioii may be chemically altered, e.g. post-traaslationally. 
For example, they may be glycosylated (one or more times, by the same or different 
sugars) or comprise modified amino acid residues. They may also be modified by the 
addition of histidine residues (to assist their purification) or by the addition of a 
signal sequence (to promote insertion into the cell membrane). The protein may 
have one or more (N) anoino- or (C) carboxyl-terminal extensions, such as an amino- 
terminal methionine residue, a small linker peptide of up to about 20-25 residues, or 
a (small) extension that facihtates purification, such as a poly-histiditie or T7 tag, an 
antigenic epitope or a (e.g. maltose) binding domain (e.g. at the C-terminus). These 
extensions may or may not be added via a linker. 

Polynucleotides. 

The present invention provides an (e.g. isolated and/or purified) polynucleotide 
encoding a (modified) protein of the invention. The present invention thus provides 
a polynucleotide encoding the modified protein. Also included is a polynucleotide 
selected firom: 

(a) a polynucleotide encoding the modified protein or the complement thereof; 

(b) a polynucleotide comprising a nucleotide sequence capable of (e.g. 
selectively) hybridising to a nucleotide sequence set out in (a), or a fragment 
thereof; 
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(c) a polynucleotide comprising a nucleotide sequence capable of (e.g. 
selectively) hybridising to the complement of the nucleotide sequence set out in 
(a), or a fragment thereof; and/or 

(d) a polynucleotide comprising a polynucleotide sequence that is degenerate 
as a result of the genetic code to a polynucleotide defined in (a), (b) or (c). 
Polynucleotides in (b) and (c) can contain the change(s) corresponding to the 

modification(s) in the protein. 

Hybridisable sequences 

The term "capable of hybridizing" means that the target polynucleotide of the 
invention can hybridize to the nucleic acid used as a probe (for example the 
nucleotide sequence of the modified protein, or a fragment thereof or the 
complement thereof) at a level significantly above background. The nucleotide 
sequence may be KNA or DNA and thus includes genomic DNA, synthetic DNA or 
cDNA. Preferably the nucleotide sequence is a DNA sequence and most preferably, a 
cDNA sequence. Polynucleotides of the invention can be synthesized according to 
methods well known in the art. 

A polynucleotide of the invention can hybridize to the coding sequence or the 
complement of the coding sequence of SEQ ID No.l at a level significantly above 
background. Background hybridization may occur, for example, because of other 
cDNAs present in a cDNA Ubrary. The signal level (eg. generated by the interaction 
between a polynucleotide of the invention and the coding sequence or complement of 
the coding sequence of SEQ ID No. 1) is typically at least 10 fold, preferably at least 
100 fold, as intense as interactions between other polynucleotides and the coding 
sequence of SEQ ID No. 1. The intensity of interaction may be measured, for 
example, by radiolabelling the probe, e.g. with ^^P. Selective hybridization may 
typically be achieved using conditions of low stringency (0.3M sodium chloride and 
0.03M sodivrai citrate at about 40'*C), medium stringency (for example, 0.3M sodium 
chloride and 0.03M sodium citrate at about 50^C) or high stringency (for example, 
0.3M sodium chloride and 0.03M sodium citrate at about 60*^0). Hybridization may 
be carried out under any suitable conditions known in the art^ and, as a guide, low 
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Stringency can be 2 x SSC at 55^*0, medium stringency can be 0.5 to 1.0 x SSC at 
eO^'C and high stringency can be 0.1 or 0.2 x SSC at 60*^0 or higher (e.g. at 68^C), all 
at0.5%SDS. 

Alterations 

Polynucleotides of the invention may comprise DNA or RNA. They may be 
single or double stranded. They may also be polynucleotides which include within 
them one or more synthetic or modified nucleotides. A number of different types of 
modifications to polynucleotides are known ui the art. These include 
methylphosphonate and phosphorotiiioate backbones and/or addition of acridine or 
polylysine chains at the 3' and/or 5' ends of the molecule. For the purposes of the 
present invention, it is to be understood that the polynucleotides described herein 
may be modified by any method available in the art. 

It is to be understood that skilled persons may, usmg routine techniques, make 
nucleotide substitutions that do not affect the protein sequence encoded by the 
polynucleotides of the invention to reflect tiie codon usage of any particular host 
organism, for example in which the polypq)tides of the invention are to be 
expressed. 

The wild-type protein sequence may be modified by nucleotide substitutions, for 
example firom or up to 1, 2 or 3 to 10, 25, 50 or 100 substitutions. The 
polynucleotide may altematively or additionally be modified by one or more 
insertions and/or deletions and/or by an extension at either or both ends. The 
modified polynucleotide generally encodes a protein which is antigenic or is an 
S-layer protein. Degenerate substitutions may be made and/or substitutions may be 
made which would result in a conservative amino acid substitution when the 
modified sequence is translated, for example as discussed with reference to proteins. 

Recombinant Aspects 

The invention also provides vectors comprising a polynucleotide of the 
invention, including cloning and expression vectors, and methods of growing. 
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transforming or transfecting such vectors in a sxiitable host cell, for example undor 
conditions in which expression of a polypeptide of the invention occurs. Provided 
also are host cells comprising a polynucleotide or vector of the invention wherein flie 
polynucleotide is heterologous to the genome of the host cell. The term 
"heterologous", usually with respect to the host cell, means that the polynucleotide 
does not naturally occur in the genome of the host cell or that the polypeptide is not 
naturally produced by that cell. 

Polynucleotides of the invention can be incorporated into a recombinant 
replicable vector, for example a cloning or expression vector. The vector may be 
used to replicate the nucleic acid in a compatible host cell. Thus in a further 
embodiment, the invention provides a method of making polynucleotides of the 
invention by introducing a polynucleotide of the invention mto a replicable vector, 
introducing the vector into a compatible host cell, and growing the host cell under 
conditions which bring about replication of the vector. The vector may be recovered 
from the host cell. Suitable host cells are described below in connection with 
expression vectors. 

Vectors 

The polynucleotide of the invention may inserted into an expression cassette. 
The vector into which the expression cassette or polynucleotide of the invention is 
inserted may be any vector which may conveniently be subjected to recombinant 
DNA procedures, and the choice of the vector will often depend on the host cell into 
which it is to be introduced. Thixs, the vector may be an autonomously replicating 
vector, i.e. a vector which exists as an extra-chromosomal entity, the repHcation of 
which is independent of chromosomal replication, e.g. a plasmid. Alternatively, the 
vector may be one which, when introduced into a host cell, is integrated into the host 
cell genome and rephcated together with the chromosome(s) into which it has been 
integrated. 

Preferably, a polynucleotide of the invention in a vector is operably linked to a 
regulatory sequence which is capable of providing for the expression of the coding 
sequence by the host cell, i.e. the vector is an expression vector. The term "operably 



wo 03/055906 




PCT/EP02/14749 



38 



linked" refers to a juxb^osition wherein the components described are in a 
relationship permitting them to function in their intended manner. A regulatory 
sequence such as a promoter, enhance or other expression regulation signal 
"operably linked" to a coding sequence is positioned in such a way that expression of 
the coding sequence is achieved under condition compatible with the control 
sequences. 

The vector may be a plasmid, cosmid, virus or phage vector, usually provided 
with aa origin of repUcation, optionally a promoter for the expression of the 
polynucleotide and optionally an enhancer and/or a regulator of the promoter. A 
terminator sequence may be present, as may be a polyadenylation sequence. The 
vector may contain one or more selectable n^rker genes, for example an ampicillin 
resistance gene (in the case of a bacterial plasmid) or a neomyciu resistance gene (for 
a mammaUan vector). Vectors may be used in vitro, for example for the production 
of RNA or used to transfect or transform a host cell. They may comprise two or 
more polynucleotides of the invention, for example for overexpression. 

The DNA sequence encoding the protein is preferably introduced into a suitable 
host as part of an expression cassette (or constract) in which the DNA sequence is 
operably linked to expression signals which are enable of directing expression of 
the DNA sequence in the host cells. For transformation of the suitable host with the 
expression construct transformation procedures are available which are well known 
to the skilled person. The expression construct can be used for transformation of the 
host as part of a vector carrying a selectable marker, or the expression construct may 
be co-transformed as a separate molecule together with the vector carrying a 
selectable marker. The vector may comprise one or more selectable marker genes. 

A vector or expression construct for a given host cell may comprise the 
following elements operably Unked to each other in a consecutive order from the 5'- 
end to 3'-end relative to the coding strand of the sequence encoding the polypeptide 
of die first aspect: 

(1) a promoter sequence capable of directing transcription of the DNA 
sequence encoding the polypeptide in the given host cell; 

(2) optionally, a signal sequence capable of directing secretion of the 
polypeptide from the given host cell into a culture medium; 
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(3) a DNA sequence encoding a mature and preferably active form of the 
polypeptide; 

(4) preferably, a transcription temiination region (terminator) capable of 
terminating transcription downstream of the DNA sequence encoding the 
polypeptide; and 

(5) optionally, a repressor (e.g. XylR). 

Downstream of the DNA sequence encoding the polypeptide there may be a 3' 
untranslated region containing one or more transcription termination sites (e.g. a 
terminator). The origin of the terminator is less critical. The terminator can e.g. be 
native to the DNA sequence encoding the polypeptide. The terminator may be 
endogenous to the host cell (in which the DNA sequence encoding the polypeptide is 
to be expressed). 

Enhanced expression of the polynucleotide encoding the protein of the invention 
may also be achieved by the selection of heterologous regulatory regions, e.g. 
promoter, secretion leader and/or terminator regions, which may serve to increase 
expression and, if desired, secretion levels of the protein of interest from the 
expression host and/or to provide for the inducible control of the expression of tiie 
polypeptide of the invention. Preferably at least one, and advantageously all, of 
these regulatory regions or elements originate from, or are compatible with, 
Lactobacillus bacteria. This may render the vectors and/or host cells food grade (or 
GRAS), except for any antigen encoding sequences. 

Aside from the promoter native to the gene encoding the protein of the 
invention, other promoters may be used to direct expression of the polypeptide of the 
invention. The promoter may be selected for its efficiency in directing the 
e3q)ression of tiie polypeptide of the invention in the desired expression host. 

Promoters/enhancers and other expression regulation signals may be selected to 
be compatible with the host cell for which the expression vector is designed. For 
example prokaryotic promoters may be used, in particular those suitable for use in 
Lactobacillus or E.coli strains. 

Examples of bacterial promoters are the xylose, lactate dehydrogenase, 
a-amylase and SPo2 promoters as well as promoters from extracellular protease 
genes. 
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Host cells and Expression 

In a ftirther aspect the invention provides a process for preparing a polypeptide 
according to the invention which comprises cultivating a host cell (e.g. transformed 
or transfected with an expression vector as described above) under conditions to 
provide for expression (by the vector) of a coding sequence encoding the protein, and 
optionally recovering the expressed protein. Polynucleotides of the invention can be 
incoiporated into a recombinant replicable vector, e.g. an expression vector. The 
vector may be used to replicate the nucleic acid in a compatible host cell. Thus in a 
further embodiment, the invention provides a method of making a polynucleotide of 
the invention by introducing a polynucleotide of the invention into a replicable 
vector, introducing the vector into a compatible host cell, and growing the host cell 
under conditions which bring about the replication of the vector. The vector may be 
recovered from the host cell. Suitable host cells include bacteria such as 
Lactobacillus or lactic acid bacteria, E. cofi, yeast (e.g. Kluyveromyces), mammalian 
cell lines and other eukaryotic cell lines, for example insect cells such as Sf9 cells 
and (e.g. filamentous) fungal cells. 

Preferably the polypeptide is produced as a secreted protein in which case the 
DNA sequence encoding a mature form of the polypeptide in the expression 
construct is operably linked to a DNA sequence encoding a signal sequence. 
Preferably the signal sequence is native Oiomologous) to the DNA sequence 
encoding the protein. Alternatively the signal sequence is foreign (heterologous) to 
the DNA sequence encoding the protein, in which case the signal sequence is 
preferably endogenous to the host cell in which the DNA sequence is e^ressed. 

The vectors may be transconjugated, transformed or transfected into a suitable 
host cell as described above to provide for expression. This process may comprise 
culturing a host cell transformed with an e7q>fession vector as described above under 
conditions to provide for expression by the vector of a coding sequence encoding the 
protein. 

A further aspect of the invmtion thus provides host cells transconjugated, 
transformed or transfected with or comprising a polynucleotide or vector of the 
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invention. Preferably the polynucleotide is carried in a vector for the replication and 
e3q)ression of the polynucleotide. The cells will be chosen to be compatible with the 
said vector and may for example be prokaiyotic (for example bacterial), fungal, yeast 
or plant cells. 

The invention encompasses processes for the production of the protein of the 
invention by means of recombinant expression of a DNA sequence encoding the 
protein . For this purpose the DNA sequence of the invention can be used for gene 
amplification and/or exchange of expression signals, such as promoters, secretion 
signal sequences, in order to allow economic production of the protein in a suitable 
homologous or heterologous host cell. A homologous host cell is herein defined as a 
host cell which is of the same species or which is a variant within the same species as 
the species fix)m which the DNA sequence is derived. 

Preferred features and characteristics of one aspect of die invention are 
equally applicable to another aspect mutatis mutandis. 

The invention will now be illustrated by means of the following Examples, 
which refer to the figures, which are provided merely for the purposes of illustration, 
and the invention is not to be construed as being Umited to the subsequent Examples. 

EXAMPLES 
Summary 

The structure of the crystallisation domain, SAN, of the SA-protein of Zr. 
acidophilus ATCC 4356 was analysed by insertion and deletion mutagenesis, and by 
proteolytic treatment Mutant SA-protein synthesised in E. coli with 7-13 amino acid 
insertions near tiie N-terminus or within regions of sequence variation in SAN 
(amino acid position 7, 45, 1 14, 125, 193) could form crystalline sheets, whereas 
insertions in conserved regions or in regions with predicted secondary structure 
elements (positions 30, 66, 88 and 156) destroyed tiiis cs^acity. An insertion in the 
cell wall binding domain (position 345) did not affect crystallisation- FACscan 
analysis of Z. acidophilus synthesising three crystallising and one non-crystaUising 
SA"protein c-myc (19 amino acids) insertion mutant was performed with c-myc 
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antibodies. Fluorescence was most pronounced for insertions at positions 125 and 
156, less for position 45 and severely reduced for position 7. Ltmnunofluorescence 
microscopy revealed a fluorescent ring in a fraction of the bacterial popxilation, 
suggesting that these bacteria synthesised mutant SA-protein only. The finding that 
the chromosomal slpK gene was replaced by the mutant allele in a subset of the 
population corroborates this conclusion. Proteolytic treatment of SA-protein indicated 
that only sites near the middle of SAN are susceptible, although potential cleavage 
sites are present through liie entire molecule. Expression in E, coli of DNA 
sequences encoding the two halves of SAN yielded peptides that could dimerise. The 
results indicate that SAN consists of a -12 kDa N- and a -1 8 kDa C-terminal 
subdomain linked by a surface exposed loop. The capacity of SA-protein of L. 
acidophilus to present epitopes, up to ~ 19 amino acids in length, at the bacterial 
surface and the exchange of the wild type by a mutant allele in ^nvo, makes the 
system suitable for application as an oral delivery vehicle. 

Introduction 

Surface layers or S-layers have been foimd m up to 400 different species of 
Eu- and Archaebacteria. They consist of one species of (glyco-)protein, the S- 
protein, which assembles into characteristic two-dimensional crystalline layers at the 
cell surface. This assembly is an entropy-driven process during which individual S- 
protein monomers form multiple interactions with each other and with the under- 
lying cell envelope (Beveridge, 1994; Sleytr & Messner, 1983). Why certain bacteria 
possess an S-layer is not always known but fliey have been shown to function as 
molecular sieve, scaffold for extracellular enzymes, protective coat or virulence 
factor (Egelseer et al, 1995; Noonan & Trust, 1997; S^ra & Sleytr, 1987). 

S-protein structure-function relationships remain poorly understood due to 
the lack of suitable methods to detOTnine S-protein structure at the atomic level. 
Certainly the best understood aspect of S-protein stracture is the association of S- 
proteins with the cell wall. Several cell wall associating domains have be«i described 
for S-proteins from Bacillus, Corynebacterium, Lactobacillus and Thermobacterium. 
Recent studies have shown that many S-proteins contain one or more copies of the 
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so-called Surface Layer Homology or SLH domain for cell surface attachment 
(Egelseer et aL, 1998; Lemaire et al, 1998; Mesnage et al, 1999; Olabania et aL, 
1996). In contrast, much less is known about the role of structural components or 
domains in S-protein crystal formation but a few studies have shown that distinct 
crystallisation domains exist in some S-proteins, possibly corresponding to the 
morphological domains observed in EM studies (Baumeister et al.y 1989; Engelhardt 
& Peters, 1998; Jarosch et al, 2001; Mesnage et aL, 2000; Smit et al, 2001). 

Several species of the genus Lactobacillus possess an S-layer. The S-layers of 
L. acidophilus and related species are composed of a single S-protein (SA-protein ) of 
around 45 kDa (Boot et al, 1996). The SA-protein of L. acidophilus ATCC 4356 and 
CbsA of X. crispatus JCM 5810 are the best studied with respect to structure and 
function (Boot et al, 1993; Sillanpaa et al, 2000; Smit et al, 2001). The function of 
the S-layer of these organisms is unknown but lactobacillar S-layers may be 
important for bacterial adhesion to intestinal epithelial cells and extracellular matrix 
(ECM) components (Schneitz et al, 1993; Toba et a/., 1995). 

SA-protein of Z. acidophilic shows considerable similarity to the putative 
product otslpB encoded by a silent S-protein gene of X. acidophilus, to S-proteins 
from L. helveticus and L. crispatus, and to a haem-agglutinating protein (HAP50) 
from L. acidophilus, but not to other S-proteins (Boot et aL, 1996; Boot et aL, 1995). 
Analysis of the primary amino acid sequences of these proteins showed that 
homology between these proteins is highest in the C-terminal one third of the 
proteins (77-99% identity) and lower in the N-terminal two third (30-72% identity). 
While the N-terminal region shows a preponderance of hydrophobic amino acids, the 
C-terminal region is mainly composed of hydrophilic residues, a large fraction of 
which consists of basic amino acids, rendering this region highly positively charged. 

It has been demonstrated tiiat the N-terminal and C-tenninal parts of SA- 
protein constitute different structural and functional domains. The N-tenninal part of 
SA-protein (amino acids 1-290) constitutes the crystallisation domain, SAN, and is 
able to form S-layer crystals with lattice parameters similar to those of crystals 
formed by intact SA-protein. The C-temiinal part of SA-protein (amino acids 291- 
412), SAC, serves to attach the S-layer to the cell wall (Smit et al, 2001). 
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SA-protein is among the smallest S-proteins known, making SA-protein an 
ide^ candidate for the study of structure-function relationship of S-proteins. 
Insertion mutagenesis is a method widely used to determine structure-function 
relationships of proteins (Single et al, 1997; Norton et al, 1998; van Geest & 
Lolkema, 2000; Wong & Hancock, 2000). In the present invention insertion 
mutagenesis and proteolytic treatment was employed to gain further insight into the 
stractural organisation of the Z. acidophilus S-layer protein and its crystallisation 
domain SAN in particular. The results suggest a structural model for S A-protein in 
which SAN is comprised of two subdomains of around 15 kDa each flanking a loop 
region that is exposed to the environment. DNA sequences CTicoding the two 
hypothetical subdomains were expressed in E. coli and the properties of the resulting 
polypeptides were determined. 

This shows that functional SA-protein mutants can be used for the efficient 
surface exposure of a selected epitope making the system suitable for application as 
an oral vaccination vehicle. 

Results 

Insertion of peptide linkers in the SA-protein ofL. acidophilus 

The linker insertion mutagenesis strategy yielded ten different slpA insertion 
mutants slpAU, slpAU, slpAlS, slpAU, slpAlS, slpA\6, slpAl^y slpAl9, slpAlO and 
slpAll^ possessing linker insertions (NcNoX; 7 or 8 amino acids) after amino acid 
45, 125, 156, 177, 30, 66, 88, 1 14, 193 and 349, respectively, of the mature SA- 
protein. One additional mutant, slpA9o^ contained a c-myc epitope insertion at amino 
acid 7 instead of the NcNoX linker insertion. The mutant genes and unmodified slpA 
{slpAlO\ were transferred to expression vector pQE30AXN and the corresponding 
proteins Sha9c, Shah, Shai2, Shai3, Shaw, Shais, Shai6, Shais, Shai9, Sha20, Sha2i anc 
Shaio were synthesised in E. coli and purified by metal-affinity chromatography 
(Table 1). 



Table L 
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%itant 


Site/type of 
modification 


Position 
(AAnoy 


AA composition^ 


Size 
(AA) 


Crystal 
formation 


10 


N-termmal histag ^ 


0 


-RGSHeGSGT - ATTTN 


13 


\ \ \ 


11 


5j/EII/linker-peptide 


45 


AIAGN - A M A A A R G N- LTGTI 


8 


1 1 1 


12 


^g^el/linker-peptide 


125 


VKFTG - AMAAARAG - TNSDN 


8 


■I 1 1 


13 


iWttrtl/linker-peptide 


156 


TNVSI - TMAAARGI - ANVYA 


8 




14 


^Jjel/linker-peptide^ 


177 


YDVTS - AM A A AKA o — UAIVI 


o 
o 




15 


C/al/linker-peptide 


349 


NAANI - VAMAAARG - DGTKR 


8 


1 1 \ 


16 


P5<I/linker-peptide 


30 


PSVSA-SMAAARGA-VAANT 


8 




18 


>4fwNIAinker-peptide 


67 


LKADT- M AAARNT-ENATI 


7 




19 


Pstl /linkerpeptide 


88 


ABLAA - SMAAARG A- GVAYT 


8 




20 


5(vl/linker-^>eptide 


114 


KTVTL- AMAAARGL-GSANS 


8 


+++ 


21 


5(yl/linker-peptide 


193 


NADNQ - AMAAARGQ - VNVAN 


8 


+ 


9c 


c-myc 


7 


TTINA-EQKLISEEDLNLE-SSSAI 


13 


+++ 


11c 


c-myc 


45 


-AMVNEQKLISEEDLNARGN- 


19 


+-H- 


12c 


c-myc 


125 


-AMVNEQKLISEEDLNARAG- 


19 


-H-f 


13c 


c-myc 


156 


-TMVNEQKLISEEDLNARGI- 


19 





1 : Amino acid (AA) positions relative to the start of the mature SA-protein 
2: Flanking sequences are underlined: these are SEQ ID NOS. 3 to 17 
3: This His-tag is present at the N-terminus of all mutants listed 

4: Insertion of 23 bp instead of 24 bp resulted in a firameshift mutation and truncation of tiie protein 

All mutant genes were expressed at high levels in E. coli. SDS-PAGE 
analysis revealed that the gene products had the expected molecular mass (Mr) of 
about 46 kDa, slightly larger than wild type SA-protein (results not shown). For 
mutant Shai4 a truncated peptide of about 25 kDa was observed, the result of a 
frame-shift mutation abready observed during sequencing. Mutant Sha9c showed a Mr 
of around 47 kDa due to a larger insert, the c-myc epitope. 



Functional analysis of SA-protein mutants synthesised in E. coli 

Centrifugation of dialysed mutant SA-proteins allowed discrimination 
between aystaUised and soluble SA-protein. Mutant proteins Sha9c, Shaiu Shai29 
Shais, Shai6 and Sha20 and the wild type protein Shaio formed a precipitate, 
indicative of SA-protein crystallisation (Smit et al, 2001), mutant Sha2i formed less 
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precipitate than the previous mutants, while mutants Shai3, Shai8> Shai9 and the 
truncated Sham did not show any precipitate formation. SDS-PAGE analysis of the 
soluble and insoluble fractions confirmed that a substantial part of S-protein was 
present in the pellet fractions of Sha9c> Shaio? Shaiu Shai2, Shai5> Shai6, and Sha20- 
For all other mutants, SA-protein was found only in the supernatant fraction (Figure 
2). Electron microscopical analysis of the precipitates formed by wild type (His- 
tagged) SA-protein (Shaio) and the precipitating mutants showed crystalUne sheets 
possessing obUque symmetry, similar to that of SA-protein (data not shown). 

Three insertion mutants, slpAl 1, slpAll, and slpA13, were selected for 
replacement of the NcNoX linker by a c-myc epitope linker. This yielded mutant 
proteins Shauc Shai2c and Shai3c with an insert of 19 amino acids and an Mr of 
aroimd 47 kDa. Introduction of the larger insert did not alter the properties of the 
mutant proteins. Shaiic and Shai2c could still form S-layer crystals whereas mutant 
Shai3c was unable to form crystals as determined by EM (data not shown). 

Construction of expression vectors for Lactobacillus and synthesis of SA-protein with 
a c-myc insertion in L, casei andL. acidophilus 

To determine whether the c-myc epitope in mutants Sha9c, Shaiic? Shai2c and 
Shai3c is buried in the SA-protein interior or is exposed on the S-layer surface, mutant 
SA-proteins were co-expressed with wild type SA-protein in L. acidophilus. For the 
expression of mutant S-protein genes in Lactobacillus new vectors were constructed 
in which the slpA9c, slpAllOy slpAVlc and slpAl3c genes were combined with the 
original slpA expression signals (the 5' leader sequence for mKNA stabilisation, 
ribosome binding site (RBS), start codon and secretion signal) (Figure 1). The 
complete sip cassettes were then transferred to the E. coli-Lactobacillus shuttle 
vector pLP401-T and introduced into L, casei ATCC 393. 

Analysis of mutant sip gene expression in L. casei revealed that all 
transfonnants produced mutant SA-protein irrespective of whether the Idh terminator 
sequence between the promoter and mutant sip gene was present or not (results not 
shown). Mutant SA-protein production under inducing conditions from vectors 
without the terminator was only marginally increased compared to that from vectors 
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with the transcription tenninator. This constitutive production of mutant SA-protein 
in L. casei under non-inducing conditions from vectors with the Idh transcription 
terminator, suggests the presence of a previously undetected promoter sequence in 
the 5' region of the sip fragments. In ftirther ejqperiments we used this promoter 
activity for constitutive mutant SA-protein production in L. casei and X. acidophilus. 
After confirmation of the synthesis and secretion by L. casei of the mutant Sa- 
proteins, of their correct relative molecular mass and the presence of the c-myc 
epitope by SDS-P AGE and westem analysis (data not shown), CsCl purified vector 
DNA (without Idh tenninator) obtained from L. casei was introduced in L. 
acidophilus ATCC 4356. 

L. acidophilus transformants produced less mutant SA-protein compared to L. 
casei. From L, acidophilus transformants producing Sa9c, Saiic Sai2c and Sauc 
mutant protein as well as the wild type strain SA-protein could be completely 
removed by 5 M LiCl extraction, indicating that the bacteria contain a mosaic S-layer 
comprised of wild type and mutant protein. Mutant proteins were not detected in the 
culture mediimi. Using SDS-PAGE and westem analysis, an additional band, with a 
sUghtly lower mobility than the wild type SA-protein band, was observed in the 5 M 
LiCl extract of the four Z. acidophilus transformants. Westem analysis also showed 
that this band reacted with anti-c-myc antibody while wild type SA-protein did not, 
confirming the presence of the c-myc epitope (results not shown). It was also 
observed that mutants Sa9c and Saiic were produced at equal levels, while the Sai2c 
production level was lower and that of Saisc even more so (Figure 3). 

SDS-PAGE analysis of soluble and precipitated SA-protein fractions obtained 
by dialysis and centriftigation of the 5 M LiCl extracts revealed that all four SA- 
protein mutants associated with botih pellet and supematant fractions, a behaviour 
typically observed for fiilly fimctional wild type SA-protein. No preferential 
association was observed of the non-crystallising mutant Saisc with the dialysate 
supematant fraction although this was observed for the protein purified fix)m E. coli 
(data not shown). 

Exposition of the c-myc epitope at the cell surface ofL, acidophilus 

Surface presentation of the c-myc epitope was analysed by flow cytometry 
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and mununofluorescence microscopy. FACScan analysis with c-noiyc antibodies 
yielded a signal for each of the L. acidophilus transfonnants although important 
differences were observed. The detected fluorescent signal increased in the following 
order: Sa9cj Saiic, Saisc Sai2c (Figure 4). Interestingly, the cytometry signals were not 
proportional to the mutant protein production levels, which were lowest for the 
mutants with the highest fluorescent signals (Sai2c and Sai3c)- 

The flow cytometry experiments also showed, especially for mutants Sai2c 
and Sai3c> a large heterogeneity in fluorescence intensity. Using immunofluorescence 
microscopy two types of fluorescent cells were observed, one with diffuse 
fluorescent spots randomly distributed on the cells and another with an intense 
fluorescent ring completely surrounding the cell (data not shown). 

Detection of recombination at the chromosomal slpA locus 

The S-layer of cells possessing a surrounding fluorescCTit ring possibly 
consists solely of mutant SA-protein since a sinoilar int^se ring was observed using 
wild type cells and anti-SA-protein serum (Boot, 1996). To detennine whether the 
chromosomal copy of the sip A gene in these cells has been inactivated or replaced by 
the mutant one carried by the expression vector a PGR strategy was applied. Using 
genomic DNA preparations isolated from L, acidophilus transformants harbouring 
pLP5//7A9c, pLPjjJpAl Ic, pLP5/pA12c, and pLP^/pAlSc we were able to ampUfy 
recombination-specific fragments for all four transformants, indicating that the 
chromosomal slpA gene had been replaced by the mutant allele. In a control 
experiment using wild type L. acidophilus genomic DNA mixed with purified 
pLPji[pA12c as template we did not find any PGR product (data not shown). 

Proteolytic treatment of SA-protein 

Limited digestion of SA-protein with various proteolytic enzymes yielded one 
major product of 36 kDa representing SAN, and several smaller fragments. The 
N-terminal sequence of these firagments has been determined: the fragments obtained 
after trypsin digestion had the following N-termini ATTIN and VKLDQ (amino 
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acids 1-5 and 140-144), and those obtained after chymotrypsin digestion AriiN and 
AINTT (1-5 and 160-164), showing that these peptides represented the N-terminal 
and C-terminal regions of SAN. Apparently only sites near the middle of SAN are 
susceptible to proteolytic cleavage although a number of other potential trypsin and 
chymotrypsin cleavage sites are present in the two peptides. 

Properties of truncated SAN fragments 

To get further insight in the structure of SAN, vectors were constructed 
encoding C-terminally (SAN2, 4 and 6 carrying the first 159, 149 and 113 amino 
acids) or N-terminally truncated peptides (SAN7, 3 and 5 comprising the last 177, 
151 and 141 amino acids). The truncated Bfis-tagged genes were e3q)ressed iaE. coli 
and the peptides S AN2, S AN4, SAN6 and S AN7 were purified by affinity 
chromatography. The peptides SAN3 and SANS were produced at very low levels 
and could not be purified probably due to increased sensitivity to E. coli proteases. 
SDS-PAGE and western analysis indicated that the truncated protems had the 
expected size. Western blottmg also showed the presence of molecules Avith twice the 
size of the monomers for the peptides, SAN2, SAN4, SAN6 and SAN7 (Figure 5). 
The stability of the peptides against proteolytic attack and their capacity to dimerise 
suggests that they constitute functionally active imits. 

Discussion 

Traditionally, S-layers have been studied by electron microscopical 
techniques and as a result much is known about their ultrastructure. Little is known, 
however, about S-proteift structure-function relationships. In recent years, with the 
advent of versatile recombinant DNA technologies, valuable new clues to the 
structural organisation of S-proteins have been obtained. Methods like deletion 
analysis, sub-cloning of domains, linker mutagenesis and "cysteine-scanning'* 
mutagenesis have been used for this purpose (Jarosch et aL^ 2001; Mesnage et aL, 
1999; Smites al, 2001). 
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In the invention linker insertion aad deletion mutagenesis were applied as 
well as S-protein proteolysis to clarify the structural organisation of the 
crystallisation domain, SAN, of the L. acidophilus SA-protein. The analysis 
comprised insertion of seven to thirteen amino-acid residues at eleven positions 
randomly distributed throughout the SA-protein, ten of which were located in SAN. 
All mutant proteins contained an additional N-terminal His-tag (13 amino acids), 
which did not interfere with SA-protein crystallisation as was previously confirmed 
by electron microscopy. 

Introduction of insertions at positions 30, 66, 88, and 156 of SA-protein 
(Shai3, Shai6, Shai8 a^d Shai9) resulted in complete abolishment of the capacity to 
form crystals in vitro. Mutants with insertions after amino acids 7, 45, 125, 114 
(Sha9c, Shah, Shai2, and Sha2o) formed crystals similar to those formed by the His- 
tagged wild type protein, while an insertion at position 193 (Sha2i) resulted in a 
partially functional mutant SA-ptotem. An insertion in the cell wall binding domain 
(Shais) had no effect on assembly, as expected. 

For the interpretation of these results multiple S-protein sequence alignanents 
and predicted S-protein sequence motifs (SillanpSa et al, 2000; Smit et a/., 2001) 
were used. The data obtained from these analyses showed that the SAN domain of 
SA-protein consists of regions with high variability in composition and length (gap 
regions) altemating with regions that show considerably less variation. It was 
proposed that the regions of higher conservation are important for domain structure- 
function (responsible either for intra- or intermolecular interactions) (Sillanpaa et aL, 
2000; Smit et aL^ 2001). Dismption of these regions may result in loss of domain 
function, while regions of variable length are more likely to represent protein surfiace 
regions or loops (Miyazawa & Jemigan, 2000) and may accept the insertion of 
additional amino acid residues. 

In general, insertions in regions tiiat were well aligned in SA-protein 
aUgimients or contained predicted secondary structure elements were not allowed. 
This in contrast to insertions in variable, non-conserved regions without any 
predicted secondary structure elements, which were accepted without dismpting 
S-layer formation. Two mutated regions are of importance. First the N-terminal 
region of the SA-protein. The N-terminus itself is flexible and does not directly play a 
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role in SA-protein crystallisation since it accepts an extension of 13 amino acids 
without loss of domain function (Smit et al.^ 2001). Insertion of additional amino 
acids at position 7 (Sha9c) was probably accepted because of the proximity to the Sa- 
protein N-terminus. If the insertion was made further away from the N-tenninus 
(amino acid 30; Shai6) then protein function was compromised, possibly due to 
improper folding of the region or to interruption of protein-protein interactions 
between SA-protein monomers. Despite the lack of significant conservation between 
amino acids 30 and 47, this region seems to be important for SA-ptotein 
crystallisation. 

The second region in which mutations 12 and 20 were made (positions 125 
and 1 14, respectively), is interesting because of its location in the centre of the SAN 
domain where insertions of up to 19 amino acids did not affect SA-protein 
crystallisation. This central region is the largest variable region observed in 
aligmnents and is the only region in SAN containing sites that are accessible to 
proteolytic enzymes. 

Finally, flow cytometric and immunofluorescence analysis of mutant Sa- 
protein-produciag L, acidophilus showed that the regions near amino acids 125 and 
156 (Sai2c and Saisc) were strongly reacting with anti-c-myc antibodies, whereas 
those near amino acids 9 and 45 (Sa9c or Saiic) were not. From these results we 
conclude that the former regions are faciug the environment, while the N-termined 
region is poorly accessible to antibodies and thus is either buried within SAN or is 
facing the S-layer pore or cell wall. 

Interestingly, mutant Sauc , although non-functional in in vitro analyses, 
forms an integral part of the S-layer in vivo^ since the mutant protein could be 
extracted from L. acidophilus transformants by 5M LiCl. Moreover, a strong positive 
signal was observed in the FACS analysis implying that c-myc in Sai3c is located at 
the bacterial surface. Since the S-layer of the transformants is comprised of wild type 
and mutant SA-protein, we conclude that Saisc can functionally interact with wild 
type SA-proteiiL 

The differrace in distribution of the flow cytometry signals, most clearly 
detectable for Z. acidophilus transfomiants producing Sai2c or Saisc, suggested a 
difference in mutant protein production levels within the population of bacteria. 
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ImmunofluorescCTce microscopy and PGR experim^ts showed that in a fraction of 
the bacterial population the wild type SA-protein was fiilly replaced by the mutant 
proteiQ as a result of the exchange of the chromosomal slpA gene by the plasmid 
encoded mutant allele. Since this phenomenon was also observed for Saisc we 
assume that Sai3c can form a genuine S-layer in vivo. An explanation why Sauc 
forms an S-layer in vivo but not in vitro might be that iateraction of SA-protein with 
the cell wall stabiUses the interaction of the Saisc monomers, or induces a 
conformational change of the monomers facilitating their assembly. Also the absence 
of the His-tag in Saisc tnay positively affect its crystallisation. 

Based on the proteolysis experiments it was assumed that the central region 
divides the SAN domain into two parts that may constitute subdomains. To verify 
this assumption the properties of N- and C-tenninally truncated SAN peptides were 
determined. Truncated forms of SAN could be produced in E, coli with a size of 12- 
19 kDa that were insensitive to proteases present in E. coli, when up to 170 C- 
terminal residues or up to 120 N-terminal residues were deleted. Both C-terminally 
and N-terminally truncated peptides were capable of forming dimers in vitro that 
could resist boiling in the presence of 1% SDS. Apparently these fragments represent 
functionally active imits, as they can dimerise through protein-protein interaction 
under strongly denaturing conditions. Since all three C-terminally truncated peptides 
were proteolytically stable and could dimerise whereas only the largest N-terminally 
truncated form (S AN7) showed this behaviour, one can conclude that the subdomain 
boundaries are located between amino acids 1 14 and 140. 

The two subdomains described above could represent the two morphological 
subdomains observed in our electron crystallographic study of SA-protein and SAN. 
For each SAN monomer four contact points with neighbouring monomers could be 
observed, three in the elongated and one in the spherical subdomain (Smit et aL^ 
2001). Figure 6 shows a model which schematically depicts how crystals of SAN 
could be formed by dimerisation and oligomerisation of its subdomains. 

Finally, the experiments demonstrate how to surface-expose epitopes on the 
S-layer of i. acidophilus. Based on this finding the presentation of other antigens on 
the surface of lactobacilli using the S-layer can be used in oral or nasal vaccines. 
Lactobacilli possess several additional properties that make them highly suitable for 
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such applications, i.e. their GRAS (generally regarded as safe) status, their 
immunomodulating properties and their capacity to evoke mucosal and systemic 
immune responses against associated antigens (Maassen et aL^ 1999; Shaw et aL^ 

2000) . The observation that mutant S-layer genes can replace the chromosomal wild 
type gene by genetic recombination, rendering the mutation genetically stable, 
further adds to the application potential of genetically engineered Lactobacillus S- 
proteins. 

Materials and Methods 

Bacterial strains and growth conditions 

L, acidophilus ATCC 4356 andX. casei ATCC 393 were cultivated 
anaerobically in MRS broth (Difco) at 3TC. Lactococcus lactis MG1614 was 
cultivated aerobically in GM17 medium at 30®C. Escherichia coli Ml 5 (pREP4) 
(Qiagen) and E. coli DH5a (Phabagen, The Netherlands) were cultivated aerobically 
in Luria (L) broth at 37°C. When necessary, media were supplemented with 1,5% 
agarose, 5 p,g/ml (L. lactis) or 7.5 |ig/ml (Z. casei andi. acidophilus) 
chloramphenicol, 100 jig/ml ampicillin and/or 25 p.g/ml kanamycin {E. coli). 

Linker insertion mutagenesis of the Z. acidophilus slpA gene 

A pET5a (Promega) derived vector, pTslpAlO (previously named pTAlO), 
containing a BamHl-Hindni PGR fragment encoding mature SA-protein (Smit et al., 

2001) was used for insertion mutagenesis. Double stranded DNA linkers were 
inserted in unique restriction sites or in sites occurring twice in the slpA gene (Figure 
la). Linkers consisted of Ncol, Notl znAXhol (NcNoX) sites flanked by restriction 
site-specific S' and 3' sticky ends. Ligation mixtures were introduced in J?, coli 
DH5a and clones harbouring ^TslpAlO vector with the desired linker insert, were 
identified by colony PGR and analysed by DNA sequencing. The resulting plasmids 
are indicated with pT^i^A followed by the mutant number as indicated in Table 1 
(PT5//7AI 1, pTj//7A12 etc). 
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Functional analysis of SA-protein insertion mutants in E. coli 

Mutant SA-protein genes were cloned in pQESOAXN as BamYfl-HinSni. 
fragments introducing an N-terminal six histidine tag (plasmids are indicated with 
^lAslpA followed by mutant nimiber). Mutant SA-protein gene expression and metal 
affinity purification were carried out as previously described (Smit et al., 2001). 
Purified mutant SA-protein (500 iiig/ml) was dialysed exhaustively against 50 mM 
Tris-HCl, pH 7.5 and checked for precipitation, which is indicative of SA-protein 
crystal formation (Smit et al., 2001). Soluble and precipitated S-protein firactions 
were separated by centrifugation and analysed by SDS-PAGE. Precipitated material 
was also analysed by electron microscopy (EM) (Smit et al^ 2001). In three mutant 
plasmids pHs/;7Al 1, pHy//>A12 and pHs/pA13 the NcNoX linker was removed by 
Ncol dsxAXhol digestion and replaced by a c-myc epitope linker (Ncol-c-myc-XhoI). 
The mutant proteins were produced and analysed as described for the other mutants. 
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Construction of plasmid pHslpA9c 

An additional insertion mutant was constructed by PGR using vector pBKl 
CBoot et a/., 1993) as template and primers 5'SLPAl (5' 

GCGCGAATTCAGATCTATCGTGGTAAGTAATAGGACGTG 3': SEQ ID NO. 
18) and CMYCKE 

(5 'CAGCGAATTCCTCGAGGTTTAAATCTTCTTCTGAA^ 
TGCGTTAATAGTAGTAGCAGCGC 3': SEQ ID NO. 19) yielding 5'slpA9c. This 
PGR product was introduced in vector pT5/j?A16-3 (containing Sail, BamBl and 
Xhol sites, after amino acid 7) as BglR-Xhol fragment yielding pT5'slpA9o. For 
purification and functional analysis of the mutant SA-protein in E. coli the sip region 
of pT5'5/jc?A9c was amplified using primers CEAMYCl 

(5'GGGGGGATGGGGTAGGGGTAGTAGTATTAAGGCAGAAG3': SEQ ID 
NO. 20)andGEA2(5' 

GGCGGGATGCAAGGTTATGGAAGTATGAGAAGATGGTATT 3': SEQ ID NO. 

21) and the BamHl-BsiETL fragment was transferred to pHs-ZjpAlO yielding pIIs/^A9c. 
Functional analysis of the purified mutant protein was pCTformed as described above. 

Re-introduction of 5 ' slpA expression signals in mutants slpAllc, slpA12c and 
slpAlSc 

A firagment including the 5' expression signals of the slpA gone (nt -190 to + 
150 relative to the ATG), flanked by BgUI (5') and Hindm (3') sites, was amplified 
fix)m plasmid pBKl (Boot et al, 1993) by PGR usmg primers 5*SLPA1 (5* 
GGGGGAATTCAGATGTATGGTGGTAAGtAATAGGAGGTG 3': SEQ ID NO. 

22) and 5'SLPA2 (5' GGGGAAGCTTGAGTAGTGCTACGAGCAGGAG 3': SEQ 
ID NO. 23). The PGR product was inserted in pGEM-T and after sequence 
confirmation excised with 5coRI and /ffndin and inserted in pUC19 to give ipS^slpA. 
Mutant SA-protein cassettes isolated from ipTslpAl Ic, ^TslpAllo, and i^TslpA\3c as 
BstEH or Pstl'Hin^ini fragments were cloned in -pS^'slpA to yield ip5'slpA\ Ic, 
i^S'slpAllo and ip5'slpA\3o. Glones containing the complete sip cassettes were 
identified by colony PGR and restriction analysis. 
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Construction of repression vectors for mutant SA-protein genes in L. casei and L. 
acidophilus 

To achieve the synthesis of mutant SA-proteins Sa9cj Saiicj Sai2c and Sai3c in 
L. casei and L. acidophilus^ the multi-host range vector pLP401-T was used. This 
vector contains the inducible promoter of the Lactobacillus amylovorus a-amylase 
gene (Pouwels et aL, 2001). The complete sip cassettes were isolated from 
pT5'slpA9c, pS^slpAllc, p5'slpAl2c and pS^slpAlSo as BgHL-HindUI fragments and 
cloned in BamBUHindlll digested pLP401-T to yield vectors pLP^-^p^Pc, 
pLPslpAl Ic, pLP^^^12c and pLPslpAlSc. Ligation mixtures were introduced in L, 
lactis MG1614 and correct clones were selected by colony PGR. Highly purified 
plasmid DNA isolates were prepared by CsCl gradient centrifugation, digested with 
Notl to remove the Idh terminator, ligated and introduced inX. casei ATCC 393. 
Colony PGR was used to confirm ttie absence of the terminator sequence. 

Expression analysis of mutant SA-protein genes in L. casei andL. acidophilus 

L. casei and L. acidophilus transformants harbouring pLPslpA9c, 
pLPslpAl Ic, pLPslpA12c and pLPslpAlBc and untransformed L. casei ATCG 393 
and L. acidophilus ATCC 4356 were inoculated 1:100 from ON cultures into 50 ml 
MRS medium and incubated for 16 h at 37°C. Cells were collected by centrifugation 
(25 min, 3,000 x g, 4°G) and proteins in the culture supernatant were precipitated 
with TGA. The cell pellet was washed once with physiological salt and subjected to 1 
M and 5 M LiGl extraction to remove surface associated proteins as described 
previously (Smit et al., 2001). The 1 M and 5 M LiGl extracts were dialysed against 
ADat4°G. 

Exposition of c-myc at the S-layer surface ofL. acidophilus transformants 

Exposition of the c-myc epitope on the S-layer surface by L. acidophilus was 
determined by flow cytometry and immimofluorescence microscopy. Transformants 
containing pLPslpA90y pLPslpAl Ic, pLPslpAllc and pUPslpAlSc and wildtype L, 
acidophilus cells w^e harvested at the ead of the log phase and washed once with 
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phosphate buffered saline (PBS) siqaplemaited with 1 % (w/v) fetal calf serum 
(FCS). After centriftigation the cell pellet was resuspended in PBS-FCS 
supplemented with 1 % FCS containing anti-SA-protein or anti-o-myc (1:5000 and 
1:50 diluted, respectively) antibodies. After incubation with FITC-labelled goat-anti- 
mouse antibodies, cells were analysed with a FACSCalibur flow cytometer (Becton 
DickinsorL, San Jose, CA) as described (Pouwels et aL, 2001), or inspected with an 
immunofluorescence microscope. 

PCR detection of chromosomal sip recombinants 

Genomic DNA was isolated from transforaiants containing expression 
vectors 'pLPslpA9o, ^'LVslpAWc, ipLVslpA\2c and pLP^/j^^lSc. PCR was performed 
according to standard procedures using primer A9 (forward, sip promoter-specific: 
5'-CTTGCTATTTCTTGAAGAG-3': SEQ ID NO. 24) and primer CMYCINT 
(reverse, c-m>;c -insert-specific: 5'-GCGTTTAAATCTTCTTCTGAA-3': SEQ ID 
NO. 25) and 15 ng of chromosomal DNA as template. To avoid misinterpretation of 
the results due to a Polymerase Halt-mediated Linkage Of Primers or PHLOP event, 
PCR was also perfomied using purified chromosomal DNA obtained from wild type 
L. acidophilus which was mixed with purified ipUPslpAllo DNA at physiological 
relevant molar ratios (chrDNA : plDNA ratio's of 1:0.3, 1:3, 1:30 and 1:300). 

Construction, purification and fidnctional analysis of truncated SAN fragments 

Three DNA fragments encoding C-terminally truncated SAN peptides were 
ampUfied by PCR with pBKl (Boot et ai, 1993) as template. Primer CEAl (5'- 
GGGGGGATCCGGTACCGCTACTACTATTAACGCAAGTTC -3', forward: 
SEQ ID NO. 26) was used in combination with primer SANAl (5'- 
CCCCGAATTCAAGCTTTTATTAGT-ATACGTTTGCAATTGA>^ 
reverse: SEQ ID NO. 27) to ampUfy SAN2, SANA2 (5'- 

CCCCAAGCTTTTATTATGAAGCAACACCGTTTTGGTC-3% reverse: SEQ ID 
NO. 28) for SAN4 and SANA3 (5'- 

CCCCAAGCTTTTATTAACCAAGGGTAACAGTCTTACC-3% reverse: SEQ ID 
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NO. 29) for SAN6. Fragments encoding three N-terminally truncated peptides were 
amplified using primer SANBl (5'-GGGGGGATCCGGTACCAAAGTTAAGT- 
TAGACCAAAACGGTG-3': SEQ ID NO. 30) for SAN3, SANB2 (5'- 
GGGGGGATCCGGTACCC-TT-ACTAATGTTTCAATTGCAAACG-3': SEQ ID 
NO. 31) for SANS and SANB3 (5'-GGGGGG- 

ATCCGGTACCTCAGCTAACTCAAATGTAAAATT-3': SEQ ID NO. 32) for 
SAN7, all three in combination with primer SANl 

(5'CCCCGAATTCAAGCTTTTATTAAATTC-TCTTGCTTAGCTGGGCTAC-3', 
reverse: SEQ ID NO. 33) and the same template. The fi:agments were cloned in 
pGEM-T and sequenced. The fi:agments encoding SAN2, SAN3, SAN4, SANS, 
S AN6 and S AN7 were excised as BarnHL-HindUL firagments and introduced in vector 
pQE30AXN (Smit et al., 2001) in fiiame with an N-terminal six histidine tag. 
Expression of gene firagments and purification of the gene products performed ds 
described previously (Smit et aL, 2001). Functional analysis of the truncated SAN 
peptides consisted of dialysis of samples (500 |J.g/ml) against 50 mM Tris-HCl, pH 
7.5 followed by SDS-PAGE and westem analysis. 

DNA manipulation, protein analysis 

DNA manipulations and protein analysis were performed followiag standard 
procedures (Laemmli, 1970; Sambrook et aL^ 1989). Plasmid DNA was isolated 
from Lactococcus/ Lactobacillus according to published procedures (Kok et aL^ 1984; 
Posno et aL, 1991). Genomic DNA was isolated from lactobacilli using 
cetyltrimethylaromonium bromide (CTAB) (Towner, 1991). Transformation of i. 
lactis, Z. casei and L, acidophilus was performed following published procedures 
(Kok et aL, 1984; Posno et aL, 1991; Walker et aL, 1996). Proteolytic treatment of 
SA-protein was performed as previously described (Smit et aL, 2001). 

Vaccine preparation and immunisation 

The capacity of recombinant Z. acidophilus S-layer proteins to evoke an 
immune response against the inserted c-myc epitope was assessed for mutant 
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S-protein Sai2c that exposes c-myc to the environment and for mutant S-protein Sa9c 
in which c-myc is buried within the S-layer, Female C57B1/6 and Balb/c mice were 
immunized three times a week with a 3-week interval between immunizations. 
Purified mutant S-protein (range 25-lOOg per dose) in phosphate buffered saline 
(pH7) or in Freund's incomplete adjuvant (Difco Laboratores) was adminitered 
intraperitoneally. Serum samples were collected 14 and 28 days past prime 
immunisation, and 7 and 14 days past booster immunisation. The induction of 
antibodies was determined in an ELISA using purified Sha9c Shai2c, and Shaio 
protein as probes, as well as a purified cmyc carrying recombinant protein (CD81- 
cmyc) as a positive control. The results indicate the recombinant L.acidophilus S- 
layer proteins with inserted antigens can evoke an immxme response to both the 
protein itself (Table 2, positive response of all Sha proteins against SAIO) as well as 
to the inserted antigen (Table 2, positive response against CDS 1 -cmyc). 



Table 2 
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The proteins (50|xl) were coated o/n onto plastic wells at 4'*C at a 
concentration of 0-lOp.g/ml and after pretreatment with skimmed milk incubated with 
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anti-cmyc antiserum (Phariningen). The recombinant protein CD81-cmyc served as 
a positive control 

In a second experiment the capacity of acidophilus expressing Sai2c or 
Sha9c to induce antibodies against the c-myc epitope can be determined. lOOg of 
dead recombinant L. acidophilus in phosphate buffered saline (pH 7) or in Freund's 
incomplete adjuvant (Difco Laboraties) is to be administered intraperitoneally in a 
2501 voliraie twice with a 4-week interval between immimisations. Serum samples 
are then collected 14 and 28 days past prime immunisation, and 7 and 14 days past 
booster immxmisation. The induction of antibodies is determined in an ELIS A using 
purified Sha9c protein as probe, as well as witii purified c-myc. This can show that 
recombinant Lactobacilli carrying S-proteins with inserted antigens can be used as 
vaccine carriers. 
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CLAIMS 

1 • A modified bacterial surface layer (S-layer) protein, the modification 
comprising the internal insertion of a heterologous polypeptide. 

2. The protein according to claim 1 wherein: 

(a) the unmodified protein is fi'om a Gram positive or non-aquatic 
bacteria, optionally a lactic acid bacteria or Lactobacillus; and/or 

(b) the heterologous polypeptide is a functional polypeptide or a 
polypeptide of interest, optionally a binding or targeting protein (such as an antigen, 
antibody, or part thereof). 

3. A protein according to claim 1 or 2 wherein: 

(c) flie protein retains most of the full length sequence of the 
unmodified S-layer protein; 

(d) the polypeptide is inserted at an intemal location at least five 
amino acids firom the C or N terminus; and/or 

(e) the modified protein has a size of firom 40 to 70 kDa. 

4. A protein according to any preceding claim which: 

(f) has a crystallisation or C-terminal domain that is 
predominantly basic, or hydrophobic, 

(g) an N-tenninal domain which is either predominantly 
hydrophilic; or 

(h) has altemating hydrophobic and hydrophilic regions. 

5. A protein according to any preceding claim wherein the heterologous 
polypeptide is inserted at a location in the protein either so that it is : 

(i) exposed, or present on the cell surface; 

(j) present in the surface layer, or the cell wall; 

^) is protected fi^om ext^ial proteolytic processing or is not recognised 
or bound by external antibodies. 

6. A protein according to any preceding claim wherein the modified or 
immodified protein: 
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(1) crystallises, optionally rato an oblique lattice (such as of pi or 

p2 symmetry); 

(m) has a cell wall anchor domain; 
(n) has a pi of at least 7; and/or 
(o) is predominantly basic. 

7. A protein according to any preceding claim wherein the polypeptide 
comprises an antigen causing or specific for a disease, and optionally is an antigen 
recognisable by an antibody or is all or part of an antigen from an anaerobic bacteria, 
optionally Clostridium. 

8. A protein according to any preceding claim which (in unmodified 
form) is fi-om Lactobacilltis acidophilus^ crispatus, helveticus, amylovorus, or 
gallinarum. 

9. A jBragment of a bacterial surface layer (S-layer) protein which is: 

a) an N-terminal fragment or a fragment that is capable of forming a 
dimer with an other such fragment or a trimer with two other such fragments; 

b) capable of forming dimers with another such fragment and either 

(i) includes an innnxmodominant or exposed loop region and is 
from 20 to 200 amino acids long; or 

(ii) excludes an entire immunodominat or exposed loop region and 
is from 20 to 155 amino acids long. 

10. A polynucleotide encoding a protein according to any of claims 1 to 9. 

11. A vector comprising a polynucleotide according to claim 1 0. 

12. A host cell comprising, or which is has been transformed with a 
vector according to claim 1 1 . 

13. A bacteria expressing a surface layer protein (or fragment) according 
to any of claims 1 to 9. 

14. A bacteria according to claim 13 which is a lactic acid bacteria, 
optionally from Lactobacillus , and is. preferably X. plantarum^ L. acidophilus or L. 
casei, 

15. A modified bacteria (other than L. casei or Bacillus) which has been 
modified to express a heterologous surface layer (S-layer) proteiiL 
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16. A bacteria according to claim 1 5 which would not nonnally, or as a 
wild-type or in unmodified form does not, possess a surface layer. 

17. A modified bacteria according to claim 1 5 or 1 6 which is a 
Lactobacillus cell and/or the S-layCT has its own, original, cell wall anchor. 

18. A bacteria according to any of claims 1 5 to 17 which is a 
Lactobacillus bacterial cell, such as L. casei, and/or the S-layer protem is firom 
Lactobacillus bacteria, such as L, acidophilus. 

19. An Z. casei bacterial cell expressing a bacterial surface layer 
(S-layer) protein that is either not from Z. crispatus or is not a collagen bidning 
protein. 

20. An L. casei cell according to claim 19 wherein the S4ayer protein is, 
or is derived, fromL. acidophilus. 

21 . A modified bacteria e>q)ressing only, or homogeneously, a 
heterologous or modified surface layer (S-layer) protein. 

22. A bacteria according to claim 21 having a genome which includes a 
polynucleotide encoding a heterologous S-layer protein, optionally integrated into 
the genome, and/or where the polynucleotide encoding the normal or wild-type 
S-layer protein has been silenced, replaced, switched off or otherwise rendered 
non-expressed. 

23. A bacteria according to claim 22 or 23 wherein the heterologous or 
modified S-layer protein is the sole or only S-layer protein expressed by the bacterial 
cell and/or the cell does not express any wild-type S-layer protein. 

24. A bacteria according to any of claims 22 to 24 wherein the S-layer 
protein is located on the surface of the cell wall and/or a multipHcity of S-layer 
proteins form an S-layer, 

25. A vaccine comprising a bacteria according to any of claims 13 to 24. 

26. A vaccine according to claim 25 which is an oral or nasal vaccine 
and/or additionally comprises an adjuvant. 

27. A sheet or (optionally crystalline) monolayer or 2-dimensional array 
comprising a plurality of bacterial surface layer proteins, at least one of which is 
modified protein according to any of claims 1 to 8. 
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28, A solid surface, liquid-air interface, lipid film, liposome or solution 
comprising a sheet, monolayer or array according to claim 27. 

29. A solid surface according to claim 28 to which is boimd one or more 
(macro) molecules, such as an enzyme, antibody or other binding molecule, receptor, 
antigen or Ugand. 

30, A soUd surface comprising a layer of S-proteins, at least a plurality of 
which are modified proteins accordiag to any one of claims 1 to 8, sandwiched 
between the surface and a layer of functional molecules. 

31. A sensor, molecular sieve or ion trap comprising a sheet, layer or 
array according to claim 27 or a surface according to any of claims 28 to 30. 
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<140> 
<141> 

<160> 33 

<170> Patentin version 3.1 

<210> 1 

<211> 1300 

<212> DNA 

<213> L. acidophilus 

<400> 1 



at^aagaaaa 


atttaagaat 


cgttagcgct 


gctgctgctg 


crttacttgc 


tgttgctcca 


60 




ctgctgtatc 


tactgttagc 


gctgctacta 


ctattaacgc 


aagttcatca 


120 


gcaatcaata 


ccaacactaa 


tgctaagtac 


gatgttgatg 


taactcctag 


tgtttctgca 


180 


gttgctgcaa 


atactgctaa 


caacactcca 


gctattgccg 


gtaaccttac 


tggtactatt 


240 


tcagcaagtt 


acaatggtaa 


gacttatact 


gctaacttaa 


aggcagatac 


tgaaaatgcc 


300 


actattactg 


ctgctggtag 


cactactgcc 


gttaaacctg 


ctgaattagc 


tgcaggtgtg 


360 


gcttacactg 


taactgttaa 


cgatgcttca 


tttaacttcg 


gttcagaaaa 


tgcaggtaag 


420 


actgttaccc 


ttggttcagc 


taactcaaat 


gtaaaattca 


ccggtacaaa 


cagtgataat 


480 


caaactgaaa 


ctaatgtttc 


tactttgaaa 


gttaagttag 


accaaaacgg 


tgttgcttca 
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cttactaatg 


tttcaattgc 


aaacgtatac 


gcaattaaca 


ctactgataa 


cagtaacgta 


600 


aacttctacg 


acgtaactag 


tggtgctact 


gtaactaacg 


gtgccgtttc 


agttaatgct 


660 


gataaccaag 


gtcaagttaa 


tgttgcaaac 


gtagttgcag 


caattaattc 


aaaatacttt 


720 


gcagcacaat 


acgcagataa 


gaagttaaat 


actcgtactg 


ctaatactga 


agatgctatt 


780 


aaggcagcct 


t-aaaggacca 


aaagattgat 


gtaaactcag 


taggttactt 


caaagcacct 


840 


catactttca 


ctgttaacgt 


taaagcaact 


tcaaatacta 


atggtaagtc 


agctactttg 


900 


ccagtagttg 


ttactgttcc 


taatgttgct 


gagccaactg 


tagccagcgt: 


aagcaagaga 


960 


attatgcaca 


acgcatacta 


ctacgacaag gacgctaagc 


gtgttggtac 


tgacagcgtt 
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Phe Ala Ala Gin Tyr Ala Asp Lys Lys Leu Asn Thr Arg Thr Ala Asn 
210 215 220 



Thr Glu Asp Ala 
225 

Asn Ser Val Gly 



Lys Ala Thr Ser 
260 

Val Thr Val Pro 
275 



lie Lys Ala Ala 
230 

Tyr Phe Lys Ala 
245 

Asn Thr Asn Gly 



Asn Val Ala Glu 
280 



Leu Lys Asp Gin 
235 

Pro His Thr Phe 
250 

Lys Ser Ala Thr 
265 

Pro Thr Val Ala 



Lys lie Asp Val 

240 

Thr Val Asn Val 
255 

Leu Pro Val Val 
270 

Ser Val Ser Lys 
285 



Arg lie Met His 
290 

Gly Thr Asp Ser 
305 

Thr Thr Thr lie 



Lys Ala Val Asp 
340 

Arg Thr Leu Lys 
355 

Ala Asn Lys Val 
370 

Ala Ser Tyr Thr 

385 

Asn Thr Asp Lys 



Asn Ala Tyr Tyr 
295 

Val Lys Arg Tyr 
310 

Asn Gly Lys Thr 
325 

Lys Tyr lie Asn 



His Asn Ala Tyr 
360 

Val Leu Lys Lys 
375 

Phe Lys Asn Gly 
390 

Thr Tyr Val Lys 
405 



Tyr Asp Lys Asp 
300 

Asn Ser Val Ser 
315 

Tyr Tyr Gin Val 
330 

Ala Ala Asn lie 
345 

Val Tyr Ala Ser 



Gly Glu Val Val 
380 

Gin Lys Tyr Tyr 
395 

Val Ala Asn Phe 
410 



Ala Lys Arg Val 



Val Leu Pro Asn 
320 

Val Glu Asn Gly 
335 

Asp Gly Thr Lys 
350 

Ser Lys Lys Arg 
365 

Thr Thr Tyr Gly 



Lys He Gly Asp 
400 

Arg 



<210> 3 

<211> 18 

<:212> PRT 

<213> ARTIFICIAL SEQtJENCS 
<220> 

<223> INSERTION LINKER 



<400> 



Arg Gly Ser Eis His His His His His Gly Ser Gly Thr Ala Thr Thr 
15 10 15 



He Asn 
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<400> 7 

Tyr Asp Val Thr Ser Ala Met Ala Ala Ala Arg Ala Ser Gly Ala Tlir 
15 10 15 

Val Thr 



<210> 8 

<211> 18 

<212> PRT 

<213> ARTIFICIAL SEQUENCE 
<220> 

<223> INSERTION LINKER 

<400> 8 

Asn Ala Ala Asn lie Val Ala Met Ala Ala Ala Arg Gly Asp Gly Thr 
1 5 ' 10 15 

Lys Arg 



<210> 9 
<211> 18 
<212> PRT 

<213> ARTIFICIAL SEQUENCE 
<22Q> 

<223> INSERTION LINKER 
<400> 9 

Pro Ser Val Ser Ala Ser Met Ala Ala Ala Arg Gly Ala Val Ala Ala 
15 10 15 

Asn Thr 



<210> 10 
<211> 17 
<212> PRT 

<213> ARTIFICIAL SEQUENCE 
<220> 

<223> INSERTION LINKER 
<400> 10 

Leu Lys Ala Asp Thr Met Ala Ala Ala Arg Asn Thr Glu Asn Ala Thr 
15 10 15 



He 
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<223> INSERTION LINKER 
<400> 14 

Thr Thr lie Asn Ala Glu Gin Lys Leu lie Ser Glu Glu Asp Leu Asn 
1 5 10 ' 15 

Leu Glu Ser Ser Ser Ala lie. 
20 



<210> 15 
<211> 19 
<212> PRT^ 

<213> ARTIFICIAL SEQUENCE 
<220> 

<22 3> INSERTION LINKER 
<400> 15 

Ala Met Val Asn Glu Gin Lys Leu lie Ser Glu Glu Asp Leu Asn Ala 
15 10 15 

Arg Gly Asn 



<210> 16 
<211> 19 
<212> PRT 

<213> ARTIFICIAL SEQUENCE 
<220> 

<223> INSERTION LINKER 
<400> 16 

Ala Met Val Asn Glu Gin Lys Leu lie Ser Glu Glu Asp Leu Asn Ala 
15 10 15 

Arg Ala Gly 



<210> 17 

<211> 19 

<212> PRT 

<213> ARTIFICIAL SEQUENCE 
<220> 

<223> INSERTION LINKER 

<400> 17 



Thr Met Val Asn Glu Gin Lys Leu lie Ser Glu Glu Asp Leu Asn Ala 
15 10 15 
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PRIMER 



<400> 22 

gcgcgaattc agatctatcg tggtaagtaa taggacgtg 



39 



<210> 23 

<211> 31 

<212> DNA 

<213> ARTIFICIAL SEQUENCE 
<220> 

<223> PRIMER 

<400> 23 
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<210> 24 

<211> 19 
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<223> PRIMER 

<400> 24 
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<210> 25 

<211> 21 

<212> DNA 

<213> ARTIFICIAL SEQUENCE 
<220> 

<223> PRIMER 

<400> 25 

gcgtttaaat cttcttctga a 



<210> 26 

<211> 39 

<212> DNA 

<213> ARTIFICIAL SEQUENCE 
<220> 

<223> PRIMER 



<400> 26 

ggggggatcc ggtaccgcra ctactattaa cgcaagttc 



39 



<210> 
<211> 
<212> 



27 
48 
DNA 
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<211> 39 
<212> DNA 

<213> ARTIFICIAL SEQUENCE 
<220> 

<223> PRIMER 
<400> 32 
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